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TO ALL WHOM IT MAY CONCERN: 

Be it known that We, Lloyd G. Mitchell, Mariano A. Garcia-Blanco, 
citizens of the United States and Madaiah Puttaraju and S. Gary Mansfield, citizens of 
India and Great Britian respectively, residing in the United States, City of Durham, State 
of North Carolina, whose post office addresses are 4500 Highgate Drive, Durham, North 
Carolina 27713, 12 Sanderling Court, Durham, North Carolina 27713, 416 Tall Oaks 
Drive, Durham, North Carolina 27713 and 1005 Prologue Road, Durham, North Carolina 
27712, respectively, have invented an improvement in 

"Methods and Compositions for Use in Spliceosome Mediated RNA Trans-splicing" 

of which the following is a 



SPECIFICATION 



The present application is a continuation-in-part of pending application 
serial number 09/158,863 filed September 23, 1998 which is a continuation-in-part of 
serial number 09/133,717 filed on August 13, 1998 which is a continuation-in-part of 
serial number 09/087,233 filed on May 28, 1998, which is a continuation-in-part of 
pending application serial number 08/766,354 filed on December 13, 1996, which claims 
benefit to provisional application number 60/008,317 filed on December 15, 1995. 

The present invention was made with government support under Grant 
Nos, SBIR R43DK56526-01 and SBIR R44DK56526-02. The government has certain 
rights in the invention. 

1. INTRODUCTION 
The present invention provides methods and compositions for generating 
novel nucleic acid molecules through targeted spliceosomal /raw^-splicing. The 
compositions of the invention include pre-/raw5-splicing molecules (PTMs) designed to 
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interact with a natural target precursor messenger RNA molecule (target pre-mRNA) and 
mediate a rraw^-splicing reaction resulting in the generation of a novel chimeric RNA 
molecule (chimeric RNA). The PTMs of the invention are genetically engineered so as to 
result in the production of a novel chimeric RNA which may itself perform a function, 
5 such as inhibiting the translation of the RNA, or that encodes a protein that complements 
a defective or inactive protein in a cell, or encodes a toxin which kills specific cells. 
Generally, the target pre-mRNA is chosen as a target because it is expressed vsdthin a 
specific cell type thus providing a means for targeting expression of the novel chimeric 
RNA to a selected cell type. The invention further relates to PTMs that have been 

10 genetically engineered for the identification of exon/intron boundaries of pre-mRNA 

molecules using an exon tagging method. In addition, PTMs can be designed to result in 
the production of chimeric RN A encoding for peptide affinity purification tags which can 
be used to purify and identify proteins expressed in a specific cell type.SThe methods of 
the invention encompass contacting the PTMs of the invention with a target pre-mRN A 

1 5 imder conditions in which a portion of the PTM is trans-spliced to a portion of the target 
pre-mRNA to form a novel chimeric RNA molecule. The methods and compositions of 
the invention can be used in cellular gene regulation, gene repair and suicide gene therapy 
for treatment of proliferative disorders such as cancer or treatment of genetic, 
autoinmiune or infectious diseases. In addition, the methods and compositions of the 

20 invention can be used to generate novel nucleic acid molecules in plants through targeted 
splicesomal /ra«^-splicing. For example, targeted /ran^-splicing may be used to regulate 
gene expression in plants for treatment of plants diseases, engineering of disease resistant 
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plants or expression of desirable genes in plants. The methods and compositions of the 
invention can also be used to map intron-exon boundaries and to identify novel proteins 
expressed in any given cell. 



which contain coding regions (exons) and generally also contain intervening non-coding 
regions (introns). Introns are removed from pre-mRNAs in a precise process called 
splicing (Chow et al, 1977, Cell 12:1-8; and Berget, S.M. et al, 1977, Proc. Natl. Acad. 
Sci. USA 74:3 1 7 1 -3 1 75). Splicing takes place as a coordinated interaction of several 

10 small nuclear ribonucleoprotein particles (snRNP's) and many protein factors that 

assemble to form an enzymatic complex known as the spliceosome (Moore et a/. ,1993, in 
The RNA World, R.F. Gestland and J.F. Atkins eds. (Cold Spring Harbor Laboratory 
Press, Cold Spring Harbor, N.Y.); Kramer, 1996, Annu. Rev. Biochem., 65:367-404; 
Staley and Guthrie, 1998, Cell 92:315-326). 

15 Pre-mRNA splicing proceeds by a two-step mechanism. In the first step, 

the 5* splice site is cleaved, resulting in a "free" 5* exon and a lariat intermediate (Moore, 
M.J. and P.A. Sharp, 1993, Nature 365:364-368). In the second step, the 5' exon is 
ligated to the 3' exon with release of the intron as the lariat product. These steps are 
catalyzed in a complex of small nuclear ribonucleoproteins and proteins called the 

20 spliceosome. 



2. BACKGROUND OF THE INVENTION 



5 



DNA sequences in the chromosome are transcribed into pre-mRNAs 
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The splicing reaction sites are defined by consensus sequences around the 



5' and 3' splice sites. The 5* splice site consensus sequence is AG/GURAGU (where 
A=adenosine, U = uracil, G = guanine, C = cytosine, R = purine and / = the splice site). 
The 3' splice region consists of three separate sequence elements: the branch point or 
5 branch site, a polypyrimidine tract and the 3* splice consensus sequence (YAG). These 
elements loosely define a 3* splice region, which may encompass 100 nucleotides of the 
intron upstream of the 3' splice site. The branch point consensus sequence in mammals is 
YNYURAC (where N = any nucleotide, Y= pyrimidine). The underlined A is the site of 
branch formation (the BPA = branch point adenosine). The 3' splice consensus sequence 

10 is YAG/G. Between the branch point and the splice site there is usually found a 

polypyrimidine tract, which is important in mammalian systems for efficient branch point 
utilization and 3' splice site recognition (Roscigno, R., F. etai, 1993, J. Biol. Chem. 
268:1 1222-1 1229). The first YAG trinucleotide downstream from the branch point and 
polypyrimidine tract is the most commonly used 3' splice site (Smith, C.W. et al, 1989, 

15 Nature 342:243-247). 



molecule, which is termed c/^-splicing. Splicing between two independently transcribed 
pre-mRNAs is termed trans-splicing, Traw^-splicing was first discovered in 
trypanosomes (Sutton & Boothroyd, 1986, Cell 47:527; Murphy et al, 1986, Cell 
20 47:517) and subsequently in nematodes (Krause & Hirsh, 1987, Cell 49:753); flatworms 
(Rajkovic etal, 1990, Proc. Nafl. Acad. Sci. USA, 87:8879; Davis etal, 1995, J. Biol. 
Chem. 270:21813) and in plant mitochondria (Malek et al, 1997, Proc. Naf 1. Acad. Sci. 
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USA 94:553). In the parasite Trypanosoma brucei, all mRNAs acquire a splice leader 
(SL) RNA at their 5' termini by trans-splicing, A 5' leader sequence is also /raw^-spliced 
onto some genes in Caenorhabditis elegans. This mechanism is appropriate for adding a 
single conmion sequence to many different transcripts. 
5 The mechanism of /r^srw^-splicing, which is nearly identical to that of 

conventional c/5-splicing, proceeds via two phosphoryl transfer reactions. The first 
causes the formation of a 2'-5' phosphodiester bond producing a ' Y' shaped branched 
intermediate, equivalent to the lariat intermediate in c/.y-splicing. The second reaction, 
exon Ugation, proceeds as in conventional c/\y-splicing. In addition, sequences at the 3' . 

10 splice site and some of the snRNPs which catalyze the /rara-spHcing reaction, closely 
resemble their counterparts involved in c/^-splicing. 

Tra^^-splicing may also refer to a different process, where an intron of one 
pre-mRNA interacts with an intron of a second pre-mRNA, enhancing the recombination 
of splice sites between two conventional pre-mRNAs. This type of /rara-splicing was 

1 5 postulated to account for transcripts encoding a human inmiunoglobulin variable region 
sequence linked to the endogenous constant region in a transgenic mouse (Shimizu et 
a/., 1989, Proc. Nat'l. Acad. Sci. USA 86:8020). In addition, trans-spYicing of c-myb pre- 
RNA has been demonstrated (Vellard, M. et al. Proc. Nafl. Acad. Sci., 1992 89:251 1- 
2515) and more recently, RNA transcripts from cloned SV40 rran^-spliced to each other 

20 were detected in cultured cells and nuclear extracts (Eul et al, 1995, EMBO. J. 14:3226). 
However, naturally occurring trans-splicing of mammalian pre-mRNAs is thought to be 
an exceedingly rare event 
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In vitro /rara-splicing has been used as a model system to examine the 
mechanism of splicing by several groups (Konarska & Sharp, 1985, Cell 46:165-171 
Solnick, 1985, Cell 42:157; Chiara & Reed, 1995, Nature 375:510; Pasman and Garcia- 
Blanco, 1996, Nucleic Acids Res. 24:1638). Reasonably efficient rra«5-splicing (30% of 
5 c/.y-spliced analog) was achieved between RNAs capable of base pairing to each other, 
splicing of RNAs not tethered by base pairing was further diminished by a factor of 10. 
Other in vitro /raw^-splicing reactions not requiring obvious RNA-RNA interactions 
among the substrates were observed by Chiara & Reed (1995, Nature 375:510), Bruzik 
J.P. & Maniatis, T. (1992, Nature 360:692) and Bruzik J.P. and Maniatis, T., (1995, Proc. 
10 Nat'L Acad. Sci. USA 92:7056-7059). These reactions occur at relatively low frequencies 
and require specialized elements, such as a downstream 5* splice site or exonic splicing 
enhancers. 

In addition to splicing mechanisms involving the binding of multiple 
proteins to the precursor mRNA which then act to correctly cut and join RNA, a third 

15 mechanism involves cutting and joining of the RNA by the intron itself, by what are 

termed catalytic RNA molecules or ribozymes. The cleavage activity of ribozymes has 
been targeted to specific RNAs by engineering a discrete "hybridization" region into the 
ribozyme. Upon hybridization to the target RNA, the catalytic region of the ribozyme 
cleaves the target. It has been suggested that such ribozyme activity would be useful for 

20 the inactivation or cleavage of target RNA in vivo, such as for the treatment of human 
diseases characterized by production of foreign of aberrant RNA. The use of antisense 
RNA has also been proposed as an alternative mechanism for targeting and destruction of 
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specific RNAs. In such instances small RNA molecules are designed to hybridize to the 
target RNA and by binding to the target RNA prevent translation of the target RNA or 
cause destruction of the RNA through activation of nucleases. 

Until recently, the practical application of targeted /raw^-splicing to 
5 modify specific target genes has been limited to group I ribozyme- based mechanisms. 
Using the Tetrahymena group I ribozyme, targeted /raw^-splicing was demonstrated in E. 
coli. coli (SuUenger B.A. and Cech. T.R,, 1994, Nature 341 :619-622) , in mouse 
fibroblasts (Jones, J.T. et al., 1996, Nature Medicine 2:643-648), human fibroblasts 
(Phylacton, L.A. et al. Nature Genetics 18:378-381) and human erythroid precursors (Lan 
10 et al., 1998, Science 280:1593-1596). While many applications of targeted RNA trans- 
splicing driven by modified group I ribozymes have been explored, targeted trans- 
splicing mediated by native mammalian splicing machinery, /.e., spliceosomes, has not 
been previously reported. 



3. SUMMARY OF THE INVENTION 

15 The present invention relates to compositions and methods for generating 

novel nucleic acid molecules through spliceosome-mediated targeted /ram'-splicing. The 
compositions of the invention include pre-/raw^-splicing molecules (hereinafter referred 
to as "PTMs") designed to interact with a natural target pre-mRNA molecule (hereinafter 
referred to as "pre-mRNA") and mediate a spliceosomal ^rara-splicing reaction resulting 

20 in the generation of a novel chimeric RNA molecule (hereinafter referred to as "chimeric 
RNA"). The methods of the invention encompass contacting the PTMs of the invention 
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with a natural target pre-mRNA under conditions in which a portion of the PTM is 
spliced to the natural pre-mRNA to form a novel chimeric RNA . The PTMs of the 
invention are genetically engineered so that the novel chimeric RNA resulting from the 
trans-splicing reaction may itself perform a function such as inhibiting the translation of 
5 RNA, or alternatively, the chimeric RNA may encode a protein that complements a 

defective or inactive protein in the cell, or encodes a toxin which kills the specific cells. 
Generally, the target pre-mRNA is chosen because it is expressed within a specific cell 
type thereby providing a means for targeting expression of the novel chimeric RNA to a 
selected cell type. The target cells may include, but are not limited to those infected with, 

10 viral or other infectious agents, benign or malignant neoplasms, or components of the 

immune system which are involved in autoimmune disease or tissue rejection. The PTMs 
of the invention may also be used to correct genetic mutations found to be associated with 
genetic diseases. In particular, double-rram-splicing reactions can be used to replace 
internal exons. The PTMs of the invention can also be genetically engineered to tag exoh 

15 sequences in a mRNA molecule as a method for identifying intron/exon boundaries in 
target pre-mRNA. The invention further relates to the use of PTM molecules that are 
genetically engineered to encode a peptide affinity purification tag for use in the 
purification and identification of proteins expressed in a specific cell type. The methods 
and compositions of the invention can be used in gene regulation, gene repair and 

20 targeted cell death. Such methods and compositions can be used for the treatment of 

various diseases including, but not limited to, genetic, infectious or autoimmune diseases 
and proliferative disorders such as cancer and to regulate gene expression in plants. 
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4. BRIEF DESCRIPTION OF THE DRAWINGS 
Figure lA. Model of Pre-rra^25-splicing RNA. 

Figure IB. Model PTM constructs and targeted /raw^-splicing strategy. 
Schematic representation of the first generation PTMs (PTM+Sp and PTM-Sp). BD, 
5 binding domain; NBD, non-binding domain; BP, branch point; PPT, pyrimidine tract; ss., 
splice site and DT-A, diphtheria toxin subunit A. Unique restriction sites within the 
PTMS are indicated by single letters: E; EcoRI; X, Xhol; K, Kpnl; P, Pstl; A, Accl; B, 
BamHI and H; Hindlll. 

Figure IC. Schematic drawing showing the binding of PTM+Sp via 
10 conventional Watson Crick base pairing to the pHCG6 target pre-mRNA and the 
proposed cis- and /ran^-splicing mechanism. 

Figure 2A. In vitro /raw^-splicing efficiency of various PTM constructs 
into pHCG6 target. A targeted binding domain and active splice sites correlate with PTM 
/raw^-splicing activity. Full length targeted (pcPTM+Sp), non-targeted (PTM-Sp) and the 
15 splice mutants [Py(-)AG(-) and BP(-)Py(-)AG(-)] PTM RNAs were added to splicing 
reactions containing pHCG6 target pre-mRNA. The products were RT-PCR amplified 
using primers pHCG-F (specific for target pHCG6 exon 1) and DT-5R (complementary 
to DT-A) and analyzed by electrophoresis in a 1.5% agarose gel. 

Figure 2B. In vitro trans-s^\\cmg efficiency of various PTM constructs. 
20 Full length PTM with a spacer between the binding domain and splice site (PTM+Sp), 
PTM without the spacer region (PTM+) and short PTMs that contain a target binding 
domain (short PTM+) or a non-target binding region (PTM-) were added to splicing 
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reactions containing pHCG target pre-mRNA. The products were RT-PCR amplified 
using primers pHCG-F and DT-3. For reactions containing the short PTMs, the reverse 
PGR primer was DT-4, since the binding site for DT-3 was removed from the PTM. 

Figure 3. Nucleotide sequence demonstrating the in vitro trans-spliced 
5 product between a PTM and target pre-mRNA. The 466 bp /ran^-spliced RT-PCR 

product from Figure 2 (lane 2) was re-ampHfied using a 5' biotin labeled forward primer 
(PHCG-F) and a nested unlabeled reverse primer (DT-3R). Single stranded DNA was 
purified and sequenced directly using toxin specific DT-3R primer. The arrow indicates 
the spUce junction between the last nucleotide of target pHCG6 exon 1 and the first 
1 0 nucleotide encoding DT-A. 

Figure 4A. Schematic diagram of the "safety" PTM and variations, 
demonstrating the PTM intramolecular base-paired stem, intended to mask the BP and 
PPT from splicing factors. Underlined sequences represent the pHCG6 intron 1 
complementary target-binding domain, sequence in italics indicate target mismatches that 
1 5 are homologous to the BP. 

Figure 4B. Schematic of a safety PTM in open configuration upon 
binding to the target. 

Figure 4C. In vitro trans-splicing reactions were carried out by incubating 
either safety PTM or safety PTM variants with the pHCG6 target. Splicing reactions 
20 were amplified by RT-PCR using pHCG-F and DT-3R primers; products were analyzed 
in a 2.0% agarose gel. 
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Figure 5. Specificity of targeted ^a«5-splicing is enhanced by the 
inclusion of a safety into the PTM. pHCG6 pre-mRNA (250 ng) and p-globin pre- 
mRNA (250 ng) were annealed together with either PTM+SF (safety) or pcPTM+Sp 
(linear) RNA (500 ng). In vitro /raw^-splicing reactions and RT-PCR analysis were 
5 performed as described under experimental procedures and the products were separated 
on a 2.0% agarose gel. Primers used for RT-PCR are as indicated. 

Figure 6. In the presence of increasing PTM concentration, c/.y-splicing is 
inhibited and replaced by rraw^-splicing. In vitro splicing reactions were performed in the 
presence of a constant amount of pHCG6 target pre-mRNA (100 ng) with increasing 
\ 0 concentrations of PTM (pcPTM+Sp) RNA (52-300 ng). RT-PCR for c/^-spliced and un- 
spliced products utilized primers pHCG-F (exon 1 specific) and pHCG-R2 (exon 2 
specific - Panel A); primers pHCG-F and DT*3R were used to RT-PCR /r^zw^-spliced 
products (Panel B). Reaction products were analyzed on 1 .5% and 2.0% agarose gels, 
respectively. In panel A, lane 9 represents the 60 min time point in the presence of 
1 5 300 ng of PTM, which is equivalent to lane 10 in panel B. 

Figure 7A. PTMs are capable of ^ara-splicing in cultured human cancer 
cells. Total RNA was isolated from each of 4 expanded neomycin resistant HI 299 lung 
carcinoma colonies transfected with pcSp+CRM (expressing non-toxic mutant DT-A) 
RT-PCR was performed using 1 |ig of total RNA and 5* biotinylated PHCG-F and non- 
20 biotinylated DT-3R primers. Single stranded DNA was purified and sequenced. 
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Figure 7B. Nucleotide sequence (sense strand) of the trans-sphcQd 
product between endogenous pHCG6 target and CRM 197 mutant toxin is shown. Two 
arrows indicate the position of the splice junction. 

Figure 8A. Schematic diagram of a double splicing pre-therapeutic 

mRNA. 

Figure SB. Selective trans-splicing of a double splicing PTM. By varying 
the PTM concentration the PTM can be trans-spliccd into either the 5' or the 3' splice site 
of the target. 

Figure 9. Schematic diagram of the use of PTM molecules for exon 
tagging. Two examples of PTMs are shown. The PTM on the left is capable of non- 
specifically trans-splicing into a target pre-mRNA 3' splice site. The other PTM on the 
right is designed to non-specifically trans-splice into a target pre-mRNA 5' splice site. A 
PTM mediated trans-splicing reaction will result in the production of a chimeric RNA 
comprising a specific tag to either the 5' or 3* side of an authentic exon. 

Figure lOA. Schematic diagram of constructs for use in the lacZ knock- 
out model. The target lacZ pre-mRNA contains the 5' fi:agment of lacZ followed by 
pHCG6 intron 1 and the 3' fi-agment of lacZ (target 1). The PTM molecule for use in the 
model system was created by digesting pPTM +SP with PstI and Hindlll and replacing 
the DT-A toxin with PHCG6 exon 2 (pc3.1PTM2). 

Figure lOB. Schematic diagram of restoration of p-Gal activity by 
Spliceosome Mediated RNA Trans-splicing. Schematic diagram of constructs for use in 
the lacZ knock-in model (pc.3.1 lacZ T2). The lacZ target pre-mRNA is identical to that 
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target pre-mRNA used for the knock-out experiments except that it contains two stop 
codons (TAA TAA) in frame four codons after the 3' splice site. The PTM molecule for 
use in the model system was created by digesting pPTM +SP with PstI and Hindlll and 
replacing the DT-A toxin with functional 3' fragment of lacZ. 
5 Figure 1 1 A. Demonstration of cis-and trans-splicing when utilizing the 

lacZ knock-out model. The LacZ splice target 1 pre-mRNA and PTM2 were 
co-transfected into 293T cells. Total RNA was then isolated and analyzed by PGR for 
c/\y-spliced and trans-spliced products using the appropriate specific primers. The 
amplified PGR products were separated on a 2% agarose gel. 
10 Figure 1 IB-G. Assays for p-galactosidase activity. 293 cells were 

transfected with lacZ target 2 DNA alone (panel B) or lacZ target 2 DNA and PTMl 
(panel G). 

Figure 12A. Nucleotide sequence of /raw^-spliced molecule 

demonstrating accurate trans-splicing. 
15 Figure 12B. Nucleotide sequences of the cz\s'-spliced product and the 

rrara-spliced product. The nucleotide sequences were those sequences expected for each 

of the different splicing reactions. 

Figure 13. Gene repair model for repair of the cystic fibrosis 

transmembrane regulator (GFTR) gene. 
20 Figure 14. RT-PGR demonstration of /raw^-splicing between an 

exogenously supplied GFTR mini-gene target and PTM. Plasmids were co-transfected 

into 293 embryonic kidney cells. The primers pairs used for RT-PGR reactions are listed 
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above each lane. The lower band (471 bp) in each lane represents a /ra«5-spliced 
product. The lower band in lane 1 (471 bp) was purified from a 2% Seakem agarose gel 
and the DNA sequence of the band was determined. 

Figure 15. DNA sequence of the trans-spliced product (lane 1, lower band 
5 shown in Figure 14). The DNA sequence indicates the presence of the F508 codon 

(CTT), exon 9 sequence is contiguous with exon 10 sequence, and the His tag sequence. 

Figure 16. Schematic representation of repair of an exogenously supplied 
CFTR target molecule carrying an F508 deletion in exon 10. 

Figure 17. Repair of endogenous CFTR transcripts by exon 10 ..r 
10 replacement using a double splicing PTM. The use of a double splicing PTM permits 
repair of the a 5 08 mutation with a very short PTM molecule. 

Figure 18. Model lacZ target consisting of lacZ 5' exon - CFTR 
mini-intron 9 - CFTR exon 10 (delta 508) - CFTR mini-intron 10 followed by the 
lacZ 3' exon. Binding domains for PTMs are bracketed. 
15 Figure 19. Schematic representation of double-^aw5-splicing PTMs 

designed to restore p-gal function. 

Figure 20. Schematic representation of a double-fran^-splicing reaction 
showing the binding of DSPTM7 with DSCFT1.6 target pre-mRNA. 

Figure 21. Important structural elements of DSPTM7. The double 
20 splicing PTM has both 3* and 5' functional splice sites as well as binding domains. 



Figure 22. Schematic diagram of mutant double splicing PTMs. 



Figure 23. Accuracy of double-/ra«5-splicing reaction. 
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Figure 24. Double-rran^-splicing between the target pre-mRNA and the 
DSPTM7 produces full-length protein. Western blot analysis of total cell ly sates using 
polyclonal anti-p-galactosidase antiserum. 

Figure 25. Precise internal exon substitution between the DSCFT1.6 
target pre-niRNA and DSPTM7 RNA by double-^aw^-splicing produces functionally 
active p-gal protein. Total cell extracts were prepared and assayed for p-gal activity 
using an ONPG assay. 

Figure 26. 3* and 5' splice sites are essential for the restoration of P-gal 
function by double-/ram-splicing reaction. 

Figure 27. Double-/ran.y-splicing: titration of target and PTM. Different 
concentrations of the target and PTM were co-transfected and analyzed for p-gal activity 
restoration. 

Figure 28. Constructs designed to test the specificity of 
double-/ra«5-splicing reaction. 

Figure 29. Specificity of a double-Zrara-splicing reaction. 

Figure 30. 7raw5-splicing repair of the cystic fibrosis gene using a PTM 
that mediates a double-Zraw^-splicing event. 

Figure 3 1 . PTM with a long binding domain masking two splice sites and 
part of exon 10 in a mini-gene target. 

Figure 32. Sequence of a single PGR product showing target exon 9 
correctly spliced to PTM exon 10 (with modified codons) (upper panel), codon 508 in 
exon 10 of the PTM (middle panel) and PTM exon 10 correctly spliced to target exon 1 1 
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(lower panel). The sequence of a repaired target was generated by RT-PCR followed by 
PGR. 

Figure 33. TronS'Splicing repair of the cystic fibrosis gene using aPTM 
that can perform 5' exon replacement. 
5 Figure 34. Schematic diagram of three different PTM molecules with 

different binding domains. 

Figure 35. Schematic diagram of PTM exon 10 with modified codon 
usage to reduce antisense effects with its own binding domain. 

Figure 36. Sequence of cis- and /ran^-spliced products. 
10 Figure 37. Model system for repair of messenger RNAs by trans-splicing, 

(A) Schematic illustration of a defective lacZCF9m splice target used in the present 
study (see Materials and Methods for details). BP, branch point; PPT, polypyrimidine 
tracts; ss, splice sites and pA, polyadenylation signal (B) A prototype PTM showing the. 
key components of the /rara-splicing domain, and the diagrams of various PTMs 
1 5 showing the binding domain length and approximate positions at which they bind to the 
target pre-mRNA. Unique restriction sites within the trans-splicing domain are N, Nhe I; 
S, Sac II; K, Kpn I and E, EcoR V. (C) Schematic diagram showing the binding of a 
PTM through antisense binding and repair of defective /acZ pre-mRNA through targeted 
RNA /raw^-splicing. Expected cis and /raw^-spliced products and the primer binding sites 
20 for Lac-9F, Lac-3R and Lac-5R are indicated. 

Figure 38. Efficient repair of lacZ messenger RNA. Target specific 
primers, Lac-9F (5' exon) and Lac-3R (3' exon) were used to amplify cw-spliced products 
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(lanes 1-6), while; target and PTM specific primers, Lac-9F (5* exon) and Lac-5R 
(3' exon) were used to antiplify /ra«5-spliced products (lanes 7-15). 25-50 ng of total 
RNA was used to measure target cz\y-splicing (lanes 1-6) and 50-200 ng of total RNA was 
used to measure PTM induced RNA /raw^-splicing (lanes 7-12). Lanes 13-15, 25-50 ng 
5 of total RNA fi'om cells transfected with lacZCF9 a control for /raw5-splicing. 

(B) Endogenous mRNA repair by /r^3fn5'- splicing. Lanes 1-3, RNA fi'om cells transfected 
with PTM-CF14; lanes 4-6, PTM-CF22 and lanes 7-9, PTM-CF24. Lane 10, RNA fi'om 
mock-transfected cells and lane 1 1 is a control in which reverse-transcription reaction 
was omitted. 

10 Figure 39. Messenger RNA repair leads to synthesis of fiiU-length 

p-galactosidase. Lane 1, lacZCF9 (posifive control, 5 /^g); lane 2, lacZCF9m target alone 
(25 Aig); lane 3, PTM-CF24 alone (25 //g) and lane 4, lacZCF9m target + PTM-CF24 
(25 /.g). 

Figure 40. Messenger RNA repair by SMaRT produces functional 
15 p-galactosidase. (A) In situ detection of fimctional p-galactosidase produced by trans- 
splicing. 293T cells were either transfected (transient assay) with lacZCF9m target alone 
(panel A) or co-transfected with lacZCF9m target + PTM-CF24 (panel B) expression 
plasmids as described above. 48-hr post-transfection, cells were rinsed with PBS and 
stained in situ for P-gal activity. (B) Repair of a defective lacZ mRNA produces 
20 fimctional p-galactosidase. Target and PTM, extracts from cells transfected with either 
lacZCF9m target or PTM-CF24 plasmid alone, and the rest were from cells co- 
transfected with lacZCF9m target and one of the PTMs as indicated. (C) Endogenous 
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mRNA repair by trans-splicing produces functional p-galactosidase. Stable cells 
expressing an endogenous lacZCF9m pre-mRNA target was transfected with "linear" 
PTMs (PTM-CF14, PTM-CF22 or PTM-CF24) as described above. Following 
transfection, total cell lysate was prepared and assayed for P-gal activity. The results 
5 presented are the average of two independent transfections. 

Figure 41. Messenger RN A repair is specific. (A) Experimental strategy 
to measure non-specific trans-splicing between lacZHCGlm pre-mRNA and "linear" 
PTMs. (B) Extended binding domains enhance the specificity of trans-splicing. 
Lanes 1-3, PTM-CF14; 4-6, PTM-CF22; 7-9, PTM-CF24; 10-12, PTM-CF26 and 13-15, 

10 PTM-CF27. (C) PTMs with very long binding domains are capable of increasing 

specificity. Total cell extract (5 ^1) was assayed in solution for p-gal activity and the 
specific activity was calculated, p-gal activity was normalized to mock and the results 
presented are the average of two independent transfections. Control, extract from cells 
transfected with lacZHCGlm target alone and the rest were co-transfected with 

1 5 lacZHCGlm target and one of the linear PTMs. 

Figure 42. Complete sequence of CFTR PTM 30 (5* exon replacement 
PTM) showing the trans-splicing domain (underlined) and the coding sequence for exons 
1-10 of the CFTR gene. Modified codons in exon 10 are underlined and bold. 
Figure 43 A. 153 base-pair PTM 24 Binding Domain. 

20 Figure 43 B. Complete sequence of CFTR PTM 24 (3' exon replacement 

PTM) showing the trans-splicing domain (underlined) and the coding sequence for exons 
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10-24 of the CFTR cDNA. At the end of the coding is a histidine tag and the translation 
stop codon. 

5. DETAILED DESCRIPTION OF THE INVENTION 
The present invention relates to compositions comprising pre-/ra«5- 
5 splicing molecules (PTMs) and the use of such molecules for generating novel nucleic 
acid molecules. The PTMs of the invention comprise one or more target binding domains 
that are designed to specifically bind to pre-mRNA, a 3' splice region that includes a 
branch point, pyrimidine tract and a 3' splice acceptor site and/or a 5' splice donor site; i:^. 
and one or more spacer regions that separate the RNA splice site from the target binding 
10 domain. In addition, the PTMs of the in vention can be engineered to contain any 
nucleotide sequences such as those encoding a translatable protein product. 

The methods of the invention encompass contacting the PTMs of the 
invention with a natural pre-mRNA under conditions in which a portion of the PTM is 
trans-spliced to a portion of the natural pre-mRNA to form a novel chimeric RN A. The 
1 5 target pre-mRNA is chosen as a target due to its expression v^thin a specific cell type 
thus providing a mechanism for targeting expression of a novel RNA to a selected cell 
type. The resulting chimeric RNA may provide a desired function, or may produce a 
gene product in the specific cell type. The specific cells may include, but are not limited 
to those infected with viral or other infectious agents, benign or malignant neoplasms, or 
20 components of the immune system which are involved in autoimmune disease or tissue 
rejection. Specificity is achieved by modification of the binding domain of the PTM to 
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bind to the target endogenous pre-mRNA. The gene products encoded by the chimeric 
RNA can be any gene, including genes having clinical usefulness, for example, 
therapeutic or marker genes, and genes encoding toxins. 

5.1. STRUCTURE OF THE PRE-^jV^-SPLICING MOLECULES 
5 The present invention provides compositions for use in generating novel 

chimeric nucleic acid molecules through targeted trans-splicing. The PTMs of the 
invention comprise (i) one or more target binding domains that targets binding of the 
PTM to a pre-mRNA (ii) a 3' splice region that includes a branch point, pyrimidine tract 
and a 3' splice acceptor site and/or 5' splice donor site; and (iii) one or more spacer 

10 regions to separate the RNA splice site from the target binding domain. Additionally, the 
PTMs can be engineered to contain any nucleotide sequence encoding a translatable 
protein product. In yet another embodiment of the invention, the PTMs can be 
engineered to contain nucleotide sequences that inhibit the translation of the chimeric 
RNA molecule. For example, the nucleotide sequences may contain translational stop 

15 codons or nucleotide sequences that form secondary structures and thereby inhibit 
translation. Alternatively, the chimeric RNA may function as an antisense molecule 
thereby inhibiting translation of the RNA to which it binds. 

The target binding domain of the PTM may contain multiple binding 
domains which are complementary to and in anti-sense orientation to the targeted region 

20 of the selected pre-mRNA. As used herein, a target binding domain is defined as any 

sequence that confers specificity of binding and anchors the pre-mRNA closely in space 
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so that the spliceosome processing machinery of the nucleus can trans-splice a portion of 
the PTM to a portion of the pre-mRN A. The target binding domains may comprise up to 
several thousand nucleotides. In preferred embodiments of the invention the binding 
domains may comprise at least 10 to 30 and up to several hundred nucleotides. As 
5 demonstrated herein, the specificity of the PTM can be increased significantly by 
increasing the length of the target binding domain. For example, the target binding 
domain may comprise several hundred nucleotides or more. In addition, although the 
target binding domain may be "linear" it is understood that the RNA may fold to form 
secondary structures that may stabilize the complex thereby increasing the efficiency of 

10 splicing. A second target binding region may be placed at the 3' end of the molecule and 
can be incorporated into the PTM of the invention. Absolute complementarity, although 
preferred, is not required. A sequence "complementary" to a portion of an RNA, as 
referred to herein, means a sequence having sufficient complementarity to be able to 
hybridize with the RNA, forming a stable duplex. The ability to hybridize will depend on 

15 both the degree of complementarity and the length, of the nucleic acid (See, for example, 
Sambrook et al., 1989, Molecular Cloning, A Laboratory Manual, 2d Ed., Cold Spring 
Harbor Laboratory Press, Cold Spring Harbor, New York). Generally, the longer the 
hybridizing nucleic acid, the more base mismatches with an RNA it may contain and still 
form a stable duplex. One skilled in the art can ascertain a tolerable degree of mismatch 

20 or length of duplex by use of standard procedures to determine the stability of the 
hybridized complex. 
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Where the PTMs are designed for use in intron-exon tagging or for 
peptide affinity tagging, a library of PTMs is genetically engineered to contain random 
nucleotide sequences in the target binding domain. Alternatively, for intron-exon tagging 
the PTMs may be genetically engineered so as to lack target binding domains. The goal 
5 of generating such a library of PTM molecules is that the library will contain a population 
of PTM molecules capable of binding to each RNA molecule expressed in the cell. A 
recombinant expression vector can be genetically engineered to contain a coding region 
for a PTM including a restriction endonuclease site that can be used for insertion of 
random DNA fragments into the PTM to form random target binding domains. The i 

10 random nucleotide sequences to be included in the PTM as target binding domains can be 
generated using a variety of different methods well known to those of skill in the art, 
including but not limited to, partial digestion of DNA with restriction enzymes or 
mechanical shearing of DNA to generate random fragments of DNA. Random binding 
domain regions may also be generated by degenerate oligonucleotide synthesis. The 

1 5 degenerate oligonucleotides can be engineered to have restriction endonuclease 

recognition sites on each end to facilitate cloning into a PTM molecule for production of 
a library of PTM molecules having degenerate binding domains. 

Binding may also be achieved through other mechanisms, for example, 
through triple helix formation or protein/nucleic acid interactions such as those in which 

20 the PTM is engineered to recognize a specific RNA binding protein, /. e. , a protein bound 
to a specific target pre-mRNA. Alternatively, the PTMs of the invention may be 
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designed to recognize secondary structures, such as for example, hairpin structures 
resulting from intramolecular base pairing between nucleotides within an RNA molecule. 

The PTM molecule also contains a 3* splice region that includes a branch 
point, pyrimidine tract and a 3* splice acceptor AG site and/or a 5' splice donor site. 
5 Consensus sequences for the 5* splice donor site and the 3' splice region used in RNA 
splicing are well known in the art (See, Moore, et aL, 1993, The RNA World, Cold 
Spring Harbor Laboratory Press, p. 303-358). In addition, modified consensus sequences 
that maintain the ability to function as 5' donor splice sites and 3' splice regions may be 
used in the practice of the invention. Briefly, the 5' splice site consensus sequence is 

10 AG/GURAGU (where A=adenosine, U^=uracil, G=guanine, C=cytosine, R=purine and 
/=the splice site). The 3' splice site consists of three separate sequence elements: the 
branch point or branch site, a polypyrimidine tract and the 3' consensus sequence (YAG). 
The branch point consensus sequence in mammals is YNYURAC (Y=pyrimidine). The 
underlined A is the site of branch formation. A polypyrimidine tract is located between 

15 the branch point and the splice site acceptor and is important for different branch point 
utilization and 3' splice site recognition. 

Further, PTMs comprising a 3' acceptor site (AG) may be genetically 
engineered. Such PTMs may further comprise a pyrimidine tract and/or branch point 
sequence. 

20 Recently, pre-messenger RNA introns beginning with the dinucleotide AU 

and ending with the dinucleotide AC have been identified and referred to as U12 introns. 
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U12 intron sequences as well as any sequences that function as splice acceptor/donor 
sequences may also be used in PTMs. 

A spacer region to separate the RNA splice site from the target binding 
domain is also included in the PTM. The spacer region can have features such as stop 
5 codons which would block any translation of an unspliced PTM and/or sequences that 
enhance /raw5'-splicing to the target pre-mRNA. 

In a preferred embodiment of the invention, a "safety" is also incorporated 
into the spacer, binding domain, or elsewhere in the PTM to prevent non-specific trahs- 
splicing. This is a region of the PTM that covers elements of the 3' and/or 5* splice site of 

10 the PTM by relatively weak complementarity, preventing non-specific trans-splicing. 
The PTM is designed in such a way that upon hybridization of the binding /targeting 
portion(s) of the PTM, the 3* and/or 5 'splice site is uncovered and becomes fully active. 

The "safety" consists of one or more complementary stretches of cis- 
sequence (or could be a second, separate, strand of nucleic acid) which weakly binds to 

15 one or both sides of the PTM branch point, pyrimidine tract, 3' splice site and/or 5' splice 
site (splicing elements), or could bind to parts of the splicing elements themselves. This 
"safety" binding prevents the splicing elements from being active (i.e. block U2 snRNP 
or other splicing factors from attaching to the PTM splice site recognition elements). The 
binding of the "safety" may be disrupted by the binding of the target binding region of the 

20 PTM to the target pre-mRNA, thus exposing and activating the PTM splicing elements 
(making them available to trans-splice into the target pre-mRNA). 
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A nucleotide sequence encoding a translatable protein capable of 
producing an effect, such as cell death, or alternatively, one that restores a missing 
function or acts as a marker, is included in the PTM of the invention. For example, the 
nucleotide sequence can include those sequences encoding gene products missing or 
5 altered in known genetic diseases. Alternatively, the nucleotide sequences can encode 
marker proteins or peptides which may be used to identify or image cells. In yet another 
embodiment of the invention nucleotide sequences encoding affinity tags such as, HIS 
tags (6 consecutive histidine residues) (Janknecht, et al., 1991, Proc. Natl. Acad. Sci. 
USA 88 : 8972-8976), the C-terminus of glutathione-S-transferase (GST) (Smith and 

10 Johnson, 1986, Proc. Natl. Acad. Sci. USA 83:8703-8707) (Pharmacia) or FLAG (Asp- 
Tyr-Lys-Asp-Asp-Asp-Lys) (Eastman Kodak/IBI, Rochester, NY) can be included in 
PTM molecules for use in affinity purification. The use of PTMs containing such 
nucleotide sequences results in the production of a chimeric RNA encoding a fiision 
protein containing peptide sequences normally expressed in a cell linked to the peptide 

1 5 affinity tag. The affinity tag provides a method for the rapid purification and 

identification of peptide sequences expressed in the cell. ' In a preferred embodiment the 
nucleotide sequences may encode toxins or other proteins which provide some fimction 
which enhances the susceptibility of the cells to subsequent treatments, such as radiation 
or chemotherapy. 

20 In a highly preferred embodiment of the invention a PTM molecule is 

designed to contain nucleotide sequences encoding the Diphtheria toxin subunit A 
(Greenfield, L., et al., 1983, Proc. Natl. Acad. Sci. USA 80: 6853-6857). Diphtheria 
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toxin subunit A contains enzymatic toxin activity and will function if expressed or 
delivered into human cells resulting in cell death. Furthermore, various other known 
peptide toxins may be used in the present invention, including but not limited to, ricin, 
Pseudomonus toxin, Shiga toxin and exotoxin A. 

Additional features can be added to the PTM molecule either after, or 
before, the nucleotide sequence encoding a translatable protein, such as polyadenylation 
signals or 5' splice sequences to enhance splicing, additional binding regions, "safety "- 
self complementary regions, additional splice sites, or protective groups to modulate the 
stability of the molecule and prevent degradation. 

Additional features that may be incorporated into the PTMs of the 
invention include stop codons or other elements in the region between the binding 
domain and the splice site to prevent unspliced pre-mRNA expression. In another 
embodiment of the invention, PTMs can be generated with a second anti-sense binding 
domain downstream from the nucleotide sequences encoding a translatable protein to 
promote binding to the 3* target intron or exon and to block the fixed authentic cis-5' 
splice site (U5 and/or Ul binding sites). 

PTMs may also be generated that require a double-Zrara-splicing reaction 
for generation of a chimeric /raw^-spliced product. Such PTMs could be used to replace 
an internal exon which could be used for RNA repair. PTMs designed to promote two 
/r^afw^-splicing reactions are engineered as described above, however, they contain both 5* 
donor sites and 3* splice acceptor sites. In addition, the PTMs may comprise two or more 
binding domains and splicer regions. The splicer regions may be place between the 
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multiple binding domains and splice sites or alternatively between the multiple binding 
domains. 

Further elements such as a 3' hairpin structure, circularized RNA, 
nucleotide base modification, or a synthetic analog can be incorporated into PTMs to 
5 promote or facilitate nuclear localization and spliceosomal incorporation, and intra- 
cellular stability. 

Additionally, when engineering PTMs for use in plant cells it may not be 
necessary to include conserved branch point sequences or polypyrimidine tracts as these 
sequences may not be essential for intron processing in plants. However, a 3* splice 

10 acceptor site and/or 5' splice donor site, such as those required for splicing in vertebrates 
and yeast, will be included. Further, the efficiency of splicing in plants may be increased 
by also including UA-rich intronic sequences. The skilled artisan will recognize that any 
sequences that are capable of mediating a trans-splicing reaction in plants may be used. 

The PTMs of the invention can be used in methods designed to produce a 

15 novel chimeric RNA in a target cell. The methods of the present invention comprise 

delivering to the target cell a PTM which may be in any form used by one skilled in the 
art, for example, an RNA molecule, or a DNA vector which is transcribed into a RNA 
molecule, wherein said PTM binds to a pre-mRNA and mediates a ^rara-splicing reaction 
resulting in formation of a chimeric RNA comprising a portion of the PTM molecule 

20 spliced to a portion of the pre-mRNA. 

5.2. SYNTHESIS OF THE TRANS-SFLICING MOLECULES 
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The nucleic acid molecules of the invention can be RNA or DNA or 
derivatives or modified versions thereof, single-stranded or double-stranded. By nucleic 
acid is meant a PTM molecule or a nucleic acid molecule encoding a PTM molecule, 
whether composed of deoxyribonucleotides or ribonucleosides, and whether composed of 
5 phosphodiester linkages or modified linkages. The term nucleic acid also specifically 
includes nucleic acids composed of bases other than the five biologically occurring bases 
(adenine, guanine, thymine, cytosine and uracil). 

The RNA and DNA molecules of the invention can be prepared by any 
method known in the art for the synthesis of DNA and RNA molecules. For example, the 

10 nucleic acids may be chemically synthesized using commercially available reagents and 
synthesizers by methods that are well knovm in the art (see, e.e., Gait, 1985, 
Oligonucleotide Synthesis: A Practical Approach, IRL Press, Oxford, England). 
Alternatively, RNA molecules can be generated by in vitro and in vivo transcription of 
DNA sequences encoding the RNA molecule. Such DNA sequences can be incorporated 

15 into a wide variety of vectors which incorporate suitable RNA polymerase promoters 
such as the T7 or SP6 polymerase promoters. RNAs may be produced in high yield via 
in vitro transcription using plasmids such as SPS65 (Promega Corporation, Madison, 
WI). In addition, RNA amplification methods such as Q-p amplification can be utilized 
to produce RNAs. 

20 The nucleic acid molecules can be modified at the base moiety, sugar 

moiety, or phosphate backbone, for example, to improve stability of the molecule, 
hybridization, transport into the cell, etc. For example, modification of a PTM to reduce 
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the overall charge can enhance the cellular uptake of the molecule. In addition 
modifications can be made to reduce susceptibility to nuclease degradation. The nucleic 
acid molecules may include other appended groups such as peptides (e.g., for targeting 
host cell receptors in v/vo), or agents facilitating transport across the cell membrane (see, 
5 e.g. , Letsinger et al, 1989, Proc. Natl. Acad. Sci. U.S.A. 86:6553-6556; Lemaitre et al, 
1987, Proc. Natl. Acad. Sci. 84:648-652; PCT Publication No. W088/09810, published 
December 15, 1988) or the blood-brain barrier (see, e.g., PCT Publication No. 
W089/10134, published April 25, 1988), hybridization-triggered cleavage agents. (See, 
e.g. , Krol et aL, 1988, BioTechniques 6:958-976) or intercalating agents. (See, e.g., Zon, 

10 1988, Pharm. Res. 5:539-549). To this end, the nucleic acid molecules may be 

conjugated to another molecule, ^.g., a peptide, hybridization triggered cross-linking 
agent, transport agent, hybridization-triggered cleavage agent, etc. Various other well- 
known modifications to the nucleic acid molecules can be introduced as a means of 
increasing intracellular stability and half-life. Possible modifications include, but are not 

1 5 limited to, the addition of flanking sequences of ribo- or deoxy- nucleotides to the 5* 
and/or 3' ends of the molecule. In some circumstances where increased stability is 
desired, nucleic acids having modified intemucleoside linkages such as 2'-0-methylation 
may be preferred. Nucleic acids containing modified intemucleoside linkages may be 
synthesized using reagents and methods that are well known in the art (see, Uhlmarm et 

20 a/., 1990, Chem. Rev. 90:543-584; Schneider etal, 1990, Tetrahedron Lett. 31:335 and 
references sited therein). 
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The nucleic acids may be purified by any suitable means, as are well 
known in the art. For example, the nucleic acids can be purified by reverse phase 
chromatography or gel electrophoresis. Of course, the skilled artisan will recognize that 
the method of purification will depend in part on the size of the nucleic acid to be 
5 purified. 

In instances where a nucleic acid molecule encoding a PTM is utilized, 
cloning techniques known in the art may be used for cloning of the nucleic acid molecule 
into an expression vector. Methods commonly known in the art of recombinant DNA 
technology which can be used are described in Ausubel et al (eds.), 1993, Current 

10 Protocols in Molecular Biology, John Wiley & Sons, NY; and Kriegler, 1990, Gene t 
Transfer and Expression, A Laboratory Manual, Stockton Press, NY. 

The DNA encoding the PTM of interest may be recombinantly engineered 
into a variety of host vector systems that also provide for replication of the DNA in large 
scale and contain the necessary elements for directing the transcription of the PTM. The 

1 5 use of such a construct to transfect target cells in the patient will result in the transcription 
of sufficient amounts of PTMs that will form complementary base pairs with the 
endogenously expressed pre-mRNA targets and thereby facilitate a /r^3w^-splicing 
reaction between the complexed nucleic acid molecules. For example, a vector can be 
introduced in vivo such that it is taken up by a cell and directs the transcription of the 

20 PTM molecule. Such a vector can remain episomal or become chromosomally 

integrated, as long as it can be transcribed to produce the desired RNA. Such vectors can 
be constructed by recombinant DNA technology methods standard in the art. 
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Vectors encoding the PTM of interest can be plasmid, viral, or others 
known in the art, used for replication and expression in mammalian cells. Expression of 
the sequence encoding the PTM can be regulated by any promoter known in the art to act 
in mammalian, preferably human cells. Such promoters can be inducible or constitutive. 
5 Such promoters include but are not limited to: the SV40 early promoter region (Benoist, 
C. and Chambon, P. 1981, Nature 290:304-310), the promoter contained in the 3* long 
terminal repeat of Rous sarcoma virus (Yamamoto et aL, 1980, Cell 22:787-797), the 
herpes thymidine kinase promoter (Wagner et aL, 1981, Proc. Natl. Acad. Sci. U.S.A. 
78:1441 1445), the regulatory sequences of the metallothionein gene (Brinster et qL, 1982, 

10 Nature 296:39-42), the viral CMV promoter, the human chorionic gonadotropin-P 

promoter (Hollenberg et al., 1994, Mol. CelL Endocrinology 106:1 1 1-1 19), etc. Any type 
of plasmid, cosmid, YAC or viral vector can be used to prepare the recombinant DNA 
construct which can be introduced directly into the tissue site. Alternatively, viral vectors 
can be used which selectively infect the desired target cell. 

1 5 For use of PTMs encoding peptide affinity purification tags, it is desirable 

to insert nucleotide sequences containing random target binding sites into the PTMs and 
clone them into a selectable mammalian expression vector system. A number of selection 
systems can be used, including but not limited to selection for expression of the herpes 
simplex virus thymidine kinase, hypoxanthine-guanine phosphoribosyltransterase and 

20 adenine phosphoribosyl tranferase protein in tk-, hgprt- or aprt- deficient cells, 

respectively. Also, anti-metabolic resistance can be used as the basis of selection for 
dihydrofolate tranferase (dhfr), which confers resistance to methotrexate; xanthine- 
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guanine phosphoribosyl transferase (gpt), which confers resistance to mycophenolic acid; 
neomycin (neo), which confers resistance to aminoglycoside G-418; and hygromycin B 
phosphotransferase (hygro) which confers resistance to hygromycin. In a preferred 
embodiment of the invention, the cell culture is transformed at a low ratio of vector to 
cell such that there will be only a single vector, or a limited number of vectors, present in 
any one cell. Vectors for use in the practice of the invention include any eukaryotic 
expression vectors, including but not limited to viral expression vectors such as those 
derived from the class of retroviruses or adeno-associated viruses. 

5.3. USES AND ADMINISTRATION OF TRANS-SPUCING MOLECULES 

5.3.1. USE OF PTM MOLECULES FOR GENE REGULATION, GENE 
REPAIR AND TARGETED CELL DEATH 

The compositions and methods of the present invention will have a variety 

of different applications including gene regulation, gene repair and targeted cell death. 

For example, trans-splicing can be used to introduce a protein with toxic properties into 

a cell. In addition, PTMs can be engineered to bind to viral mRNA and destroy the 

function of the viral mRNA, or alternatively, to destroy any cell expressing the viral 

mRNA. In yet another embodiment of the invention, PTMs can be engineered to place a 

stop codon in a deleterious mRNA transcript thereby decreasing the expression of that 

transcript. 

Targeted /rara-splicing, including double-/raw5-splicing reactions, can be 
used to repair or correct transcripts that are either truncated or contain point mutations. 
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The PTMs of the invention are designed to cleave a targeted transcript upstream or 
downstream of a specific mutation or upstream of a premature 3' and correct the mutant 
transcript via a /ran^-splicing reaction which replaces the portion of the transcript 
containing the mutation with a functional sequence. 
5 In addition, double trans -splicing reactions may be used for the selective 

expression of a toxin in tumor cells. For example, PTMs can be designed to replace the 
second exon of the human P-chronic gonadotropin-6 (phCG6) gene transcripts and to 
deliver an exon encoding the subunit A of diptheria toxin (DT-A). Expression of DT-A 
in the absence of subunit B should lead to toxicity only in the cells expressing the gene,! , 

10 phCG6 is a prototypical target for genetic modification by trans-splicing. The sequence 
and the structure of the phCG6 gene are completely kxiovra and the pattern of splicing has 
been determined. The phCG6 gene is highly expressed in many types of solid tumors, 
including many non-germ line tumors, but the phCG6 gene is silent in the majority cells 
in a normal adult. Therefore, the phCG6 pre-mRNA represents a desirable target for a 

1 5 trans-splicing reaction designed to produce tumor-specific toxicity . 

The first exon of phCG6 pre-mRNA is ideal in that it encodes only five 
amino acids, including the initiator AUG, which should result in minimal interference 
with the proper folding of the DT-A toxin while providing the required signals for 
effective translation of the trans-spliced mRNA. The DT-A exon, which is designed to 

20 include a stop codon to prevent chimeric protein formation, will be engineered to trans- 
splice into the last exon of the phCG6 gene. The last exon of the phCG6 gene provides 
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the construct with the appropriate signals to polyadenylate the mRNA and ensure 
translation. 

Cystic fibrosis (CF) is one of the most common fatal genetic disease in 
humans. Based on both genetic and molecular analyses, the gene associated with cystic 
5 fibrosis has been isolated and its protein product deduced (Kerem, B.S. et al., 1989, 
Science 245:1073-1080; Riordan et al., 1989, Science 245: 1066-1 073 ;Rommans, et al, 
1989, Science 245:1059-1065). The protein product of the CF associated gene is called 
the cystic fibrosis transmembrane conductance regulator (CFTR). In a specific 
embodiment of the invention, a trans-splicing reaction will be used to correct a genetic . 

1 0 defect in the DNA sequence encoding the cystic fibrosis transmembrane regulator 
(CFTR) whereby the DNA sequence encoding the cystic fibrosis trans-membrane 
regulator protein is expressed and a functional chloride ion channel is produced in the 
airway epithelial cells of a patient. 

Population studies have indicated that the most common cystic fibrosis 

15 mutation is a deletion of the three nucleotides in exon 10 that encode phenylalanine at 
position 508 of the CFTR amino acid sequence. As indicated in Figure 15, a trans- 
splicing reaction was capable of correcting the deletion at position 508 in the CFTR 
amino acid sequence. The PTM used for correction of the genetic defect contained a 
CFTR BD intron 9 sequence, a spacer sequence, a branch point, a polypyrimidine tract, a 

20 3' splice site and a wild type CFTR BD exon 10 sequence (Figure 13). The successful 
correction of the mutated DNA encoding CFTR utilizing a trans-splicing reaction 
supports the general application of PTMs for correction of genetic defects. 
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The methods and compositions of the invention may also be used to 
regulate gene expression in plants. For example, trans-splicing may be used to place the 
expression of any engineered gene under the natural regulation of a chosen target plant 
gene, thereby regulating the expression of the engineered gene. Trans-splicing may also 
5 be used to prevent the expression of engineered genes in non-host plants or to convert an 
endogenous gene product into a more desirable product. 

In a specific embodiment of the invention rraw-^plicing may be used to 
regulate the expression of the insecticidal gene that produces Bt toxin (Bacillus 
thuringiensis). For example, the PTM may be designed to trans-splice into an injury 

10 response gene (pre-mRNA) that is expressed only after an insect bites the plant. Thus, all 
cells of the plant would carry the gene for Bt in the PTM, but the cells would only 
produce Bt when and where an insect injures the plant. The rest of the plant will make 
little or no Bt. A PTM could trans-splice the Bt gene into any chosen gene with a desired 
pattern of expression. Further, it should be possible to target a PTM so that no Bt is 

1 5 produced in the edible portion of the plant. 

One advantage associated with the use of PTMs is that the PTM acquires 
the native gene control elements of the target gene, thus, reducing the time and effort that 
might othenvise be spent attempting to identify and reconstitute appropriate regulatory 
sequences upstream of an engineered gene. Thus, expression of the PTM regulated gene 

20 should occur only in those plant cells containing the target pre-mRNA. By targeting a 
gene not expressed in the edible portion of the plant or in the pollen, trans-splicing can 
alleviate opposition to genetically modified plants, as consumers would not be eating the 
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proteins made from modified genes. The edible portion of such crops should test 
negative for genetically modified proteins. 

In addition, PTM can be targeted to a imique sequence of the host gene 
that is not present in other plants. Therefore, even if the gene (DNA) which encodes the 
5 PTM jumps to another species of plant, the PTM gene will not have an appropriate target 
for ^aw5-splicing. Thus, ^ara-splicing offers a "fail-safe" mode for prevention of gene 
"jumping" to Other plant species: the PTM gene will be expressed only in the engineered 
host plant, which contains the appropriate target pre-mRNA. Expression in non- 
engineered plants would not be possible. 

10 7>a«5-splicing also provides a more efficient way to convert one gene 

product into another. For example, trans-splicing ribozymes and chimeric oligos can be 
incorporated into com genomes to modify the ratio of saturated to unsaturated oils. 
Traw^-splicing can also be used to convert one gene product into another. 

Various delivery systems are known and can be used to transfer the 

15 compositions of the invention into cells, e.g. encapsulation in liposomes, microparticles, 
microcapsules, recombinant cells capable of expressing the composition, receptor- 
mediated endocytosis (see, e.g., Wu and Wu, 1987, J. Biol. Chem. 262:4429-4432), 
construction of a nucleic acid as part of a retroviral or other vector, injection of DNA, 
electroporation, calcium phosphate mediated transfection, etc. 

20 The compositions and methods can be used to treat cancer and other 

serious viral infections, autoimmune disorders, and other pathological conditions in 

f 
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which the alteration or elimination of a specific cell type would be beneficial. 
Additionally, the compositions and methods may also be used to provide a gene encoding 
a functional biologically active molecule to cells of an individual with an inherited 
genetic disorder where expression of the missing or mutant gene product produces a 
5 normal phenotype. 

In a preferred embodiment, nucleic acids comprising a sequence encoding 
a PTM are administered to promote PTM function, by way of gene delivery and 
expression into a host cell. In this embodiment of the invention, the nucleic acid 
mediates an effect by promoting PTM production. Any of the methods for gene delivery 

1 0 into a host cell available in the art can be used according to the present invention. For 
general reviews of the methods of gene delivery see Strauss, M. and Barranger, J.A., 
1997, Concepts in Gene Therapy, by Walter de Gruyter & Co., Berlin; Goldspiel et ai, 
1993, Clinical Pharmacy 12:488-505; Wu and Wu, 1991, Biotherapy 3:87-95; 
Tolstoshev, 1993, Ann. Rev. Pharmacol. Toxicol. 33:573-596; Mulligan, 1993, Science 

15 260:926-932; and Morgan and Anderson, 1993, Ann. Rev. Biochem. 62:191-217; 1993, 
TIBTECH 1 1 (5): 1 55-2 15. Exemplary methods are described below. 

Delivery of the nucleic acid into a host cell may be either direct, in which 
case the host is directly exposed to the nucleic acid or nucleic acid-carrying vector, or 
indirect, in which case, host cells are first transformed with the nucleic acid in vitro, then 

20 transplanted into the host. These two approaches are known, respectively, as in vivo or ex 
vivo gene delivery. 
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In a specific embodiment, the nucleic acid is directly administered in v/vo, 
where it is expressed to produce the PTM. This can be accomplished by any of numerous 
methods known in the art, e.g., by constructing it as part of an appropriate nucleic acid 
expression vector and administering it so that it becomes intracellular, e.g. by infection 
5 using a defective or attenuated retroviral or other viral vector (see U.S. Patent No. 

4,980,286), or by direct injection of naked DNA, or by use of microparticle bombardment 
(e.g., a gene gun; Biolistic, Dupont), or coating with lipids or cell-surface receptors or 
transfecting agents, encapsulation in liposomes, microparticles, or microcapsules, or by 
administering it in linkage to a peptide which is known to enter the nucleus, by ; 

1 0 administering it in linkage to a ligand subject to receptor-mediated endocytosis (see e.g. , 
Wu and Wu, 1987, J. Biol. Chem. 262:4429-4432). 

In a specific embodiment, a viral vector that contains the PTM can be 
used. For example, a retroviral vector can be utilized that has been modified to delete 
retroviral sequences that are not necessary for packaging of the viral genome and 

15 integration into host cell DNA (see Miller et al, 1993, Meth. Enzymol. 217:581-599). 
Alternatively, adenoviral or adeno-associated viral vectors can be used for gene delivery 
to cells or tissues. (See, Kozarsky and Wilson, 1993, Current Opinion in Genetics and 
Development 3:499-503 for a review of adenovirus-based gene delivery). 

Another approach to gene delivery into a cell involves transferring a gene 

20 to cells in tissue culture by such methods as electroporation, lipofection, calcium 
phosphate mediated transfection, or viral infection. Usually, the method of transfer 
includes the transfer of a selectable marker to the cells. The cells are then placed under 
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selection to isolate those cells that have taken up and are expressing the transferred gene. 
The resulting recombinant cells can be delivered to a host by various methods knovm in 
the art. In a preferred embodiment, the cell used for gene delivery is autologous to the 
host cell. 

5 The present invention also provides for pharmaceutical compositions 

comprising an effective amount of a PTM or a nucleic acid encoding a PTM, and a 
pharmaceutically acceptable carrier. In a specific embodiment, the term 
"pharmaceutically acceptable" means approved by a regulatory agency of the Federal or a 
state government or listed in the U.S. Pharmacopeia or other generally recognized 

10 pharmacopeia for use in animals, and more particularly in humans. The term "carrier" 
refers to a diluent, adjuvant, excipient, or vehicle with which the therapeutic is 
administered. Examples of suitable pharmaceutical carriers are described in 
"Remington's Pharmaceutical sciences" by E.W. Martin. 

In specific embodiments, pharmaceutical compositions are administered: 

15 (1) in diseases or disorders involving an absence or decreased (relative to normal or 
desired) level of an endogenous protein or function, for example, in hosts where the 
protein is lacking, genetically defective, biologically inactive or underactive, or under 
expressed; or (2) in diseases or disorders wherein, in vitro or in vivo, assays indicate the 
utility of PTMs that inhibit the function of a particular protein . The activity of the 

20 protein encoded for by the chimeric mRNA resulting fi^om the PTM mediated trans- 
splicing reaction can be readily detected, e.g., by obtaining a host tissue sample (e.g., 
fi"om biopsy tissue) and assaying it in vitro for mRNA or protein levels, structure and/or 
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activity of the expressed chimeric mRNA. Many methods standard in the art can be thus 
employed, including but not limited to immunoassays to detect and/or visualize the 
protein encoded for by the chimeric mRNA (e.g., Western blot, immunoprecipitation 
followed by sodium dodecyl sulfate polyacrylamide gel electrophoresis, 
5 immunocytochemistry, etc.) and/or hybridization assays to detect formation of chimeric 
mRNA expression by detecting and/or visualizing the presence of chimeric mRNA (e.g., 
Northem assays, dot blots, in situ hybridization, and Reverse-Transcription PGR, etc.), 
etc. 

The present invention also provides for pharmaceutical compositions 
10 comprising an effective amount of a PTM or a nucleic acid encoding a PTM, and a 
pharmaceutically acceptable carrier. In a specific embodiment, the term 
"pharmaceutically acceptable" means approved by a regulatory agency of the Federal or a 
state government or listed in the U.S. Pharmacopeia or other generally recognized 
pharmacopeia for use in animals, and more particularly in humans. The term "carrier" 
1 5 refers to a diluent, adjuvant, excipient, or vehicle with which the therapeutic is 
administered. Examples of suitable pharmaceutical carriers are described in 
"Remington's Pharmaceutical sciences" by E.W. Martin. In a specific embodiment, it may 
be desirable to administer the pharmaceutical compositions of the invention locally to the 
area in need of treatment. This may be achieved by, for example, and not by way of 
20 limitation, local infusion during surgery, topical application, e.g., in conjunction with a 
woimd dressing after surgery, by injection, by means of a catheter, by means of a 
suppository, or by means of an implant, said implant being of a porous, non-porous, or 
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gelatinous material, including membranes, such as sialastic membranes, or fibers. Other 
control release drug delivery systems, such as nanoparticles, matrices such as controUed- 
release polymers, hydrogels. 

The PTM will be administered in amounts which are effective to produce 
the desired effect in the targeted cell. Effective dosages of the PTMs can be determined 
through procedures well known to those in the art which address such parameters as 
biological half-life, bioavailability and toxicity. The amount of the composition of the 
invention which will be effective will depend on the nature of the disease or disorder 
being treated, and can be determined by standard clinical techniques. In addition, in vitip^ 
assays may optionally be employed to help identify optimal dosage ranges. 

The present invention also provides a pharmaceutical pack or kit 
comprising one or more containers filled with one or more of the ingredients of the 
pharmaceutical compositions of the invention optionally associated with such 
container(s) can be a notice in the form prescribed by a governmental agency regulating 
the manufacture, use or sale of pharmaceuticals or biological products, which notice 
reflects approval by the agency of manufacture, use or sale for human administration. 

5.3.2. USE OF PTM MOLECULES FOR EXON TAGGING 
In view of current efforts to sequence and characterize the genomes of 
humans and other organisms, there is a need for methods that facilitate such 
characterization. A majority of the information currently obtained by genomic mapping 
and sequencing is derived from complementary DNA (cDNA) libraries, which are made 
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by reverse transcription of mRNA into cDNA. Unfortunately, this process causes the loss 
of information concerning intron sequences and the location of exon/intron boundaries. 

The present invention encompasses a method for mapping exon-intron 
boundaries in pre-mRNA molecules comprising (i) contacting a pre-trans-splicing 
5 molecule with a pre-mRNA molecule under conditions in which a portion of the pre- 
trans-splicing molecule is trans-spUced to a portion of the target pre-mRNA to form a 
chimeric mRNA; (ii) amplifying the chimeric mRNA molecule; (iii) selectively 
purifying the amplified molecule; and (iv) determining the nucleotide sequence of the 
amplified molecule thereby identifying the intron-exon boundaries. 

10 In an embodiment of the present invention, PTMs can be used in trans- 

splicing reactions to locate exon-intron boundaries in pre-mRNAs molecules. PTMs for 
use in mapping of intron-exon boundaries have stmctures similar to those described 
above in Section 5.1. Specifically, the PTMs contain (i) a target binding domain that is 
designed to bind to many pre-mRNAs: (ii) a 3' splice region that includes a branch point, 

15 pyrimidine tract and a 3* splice acceptor site, or a 5' splice donor site; (iii) a spacer region 
that separates the mRNA splice site from the target binding domain; and (iv) a tag region 
that will be trans-spliced onto a pre-mRNA. Alternatively, the PTMs to be used to locate 
exon-intron boundaries may be engineered to contain no target binding domain. 

For purposes of intron-exon mapping, the PTMs are genetically 

20 engineered to contain target binding domains comprising random nucleotide sequences. 
The random nucleotide sequences contain at least 15-30 and up to several hundred 
nucleotide sequences capable of binding and anchoring a pre-mRNA so that the 
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spliceosome processing machinery of the nucleus can trans-splice a portion (tag or 
marker region) of the PTM to a portion of the pre-mRNA. PTMs containing short target 
binding domains, or containing inosines bind under less stringent conditions to the pre^ 
mRNA molecules. In addition, strong branch point sequences and pyrimidine tracts serve 
5 to increase the non-specificity of PTM trans-spUcing. 

The random nucleotide sequences used as target binding domains in the 
PTM molecules can be generated using a variety of different methods, including, but not 
limited to, partial digestion of DNA with restriction endonucleases or mechanical 
shearing of the DNA. The use of such random nucleotide sequences is designed to 

10 generate a vast array of PTM molecules with different binding activities for each target 
pre-mRNA expressed in a cell. Randomized libraries of oligonucleotides can be 
synthesized with appropriate restriction endonucleases recognition sites on each end for 
cloning into PTM molecules genetically engineered into plasmid vectors. When the 
randomized oligonucleotides are litigated and expressed, a randomized binding library of 

1 5 PTMs is generated. 

In a specific embodiment of the invention, an expression library encoding 
PTM molecules containing target binding domains comprising random nucleotide 
sequences can be generated using a variety of methods which are well known to those of 
skill in the art. Ideally, the library is complex enough to contain PTM molecules capable 

20 of interacting with each target pre-mRNA expressed in a cell. 

By way of example. Figure 9 is a schematic representation of two forms of 
PTMs which can be utilized to map intron-exon boundaries. The PTM on the left is 
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capable of non-specifically trans-splicing into a pre-mRNA 3' splice site, while the PTM 
on the right is capable of trans-splicing into a pre-mRNA 5' splice site. Trans-splicing 
between the PTM and the target pre-mRNA results in the production of a chimeric 
mRNA molecule having a specific nucleotide sequence "tag" on either the 3' or 5' end of 
5 an authentic exon. 

Following selective purification, a DNA sequencing reaction is then 
performed using a primer which begins in the tag nucleotide sequence of the PTM and 
proceeds into the sequence of the tagged exon. The sequence immediately following the 
last nucleotide of the tag nucleotide sequence represents an exon boundary. For 
1 0 identification of intron-exon tags, the trans-splicing reactions of the invention can be 

performed either in vitro or in vivo using methods well known to those of skill in the art. 



5.3.3. USE OF PTM MOLECULES FOR IDENTIFICATION OF 
PROTEINS EXPRESSED IN A CELL 

In yet another embodiment of the invention, PTM mediated trans-splicing 

1 5 reactions can be used to identify previously undetected and unknown proteins expressed 

in a cell. This method is especially useful for identification of proteins that cannot be 

detected by a two-dimensional electrophoresis, or by other methods, due to inter alia the 

small size of the protein, low concentration of the protein, or failure to detect the protein 

due to similar migration patterns with other proteins in two-dimensional electrophoresis. 

20 The present invention relates to a method for identifying proteins 

expressed in a cell comprising (i) contacting a pre-trans-splicing molecule containing a 
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random target binding domain and a nucleotide sequence encoding a peptide tag with a 
pre-mRNA molecule under conditions in which a portion of the pre-trans-splicing 
molecule is trans-spliced to a portion of the target pre-mRNA to form a chimeric mRNA 
encoding a fusion polypeptide or separating it by gel electrophoresis (ii) affinity purifying 
5 the fusion polypeptide; and (iii) determining the amino acid sequence of the fusion 
protein. 

To identify proteins expressed in a cell, the PTMs of the invention are 
genetically engineered to contain: (i) a target binding domain comprising randomized 
nucleotide sequences; (ii) a 3* splice region that includes a branch point, pyrimidine tract 

10 and a 3' splice acceptor site and/or a 5' splice donor site; (iii) a spacer region that 
separates the PTM splice site firom the target binding domain; and (iv) nucleotide 
sequences encoding a marker or peptide affinity purification tag. Such peptide tags 
include, but are not limited to, HIS tags (6 histidine consecutive residues) (JanJcnecht, 
et al., 1991 Proc. Natl. Acad. Sci. USA 88:8972-8976), glutathione-S-transferase (GST) 

15 (Smith, D.B. and Johnson K.S., 1988, Gene 67:31) (Pharmacia) or FLAG (Kodak/IBI) 
tags (Nisson, J. et al. J. Mol. Recognit., 1996, 5:585-594) 

Trans-splicing reactions using such PTMs results in the generation of 
chimeric mRNA molecules encoding fusion proteins comprising protein sequences 
normally expressed in a cell linked to a marker or peptide affinity purification tag. The 

20 desired goal of such a method is that every protein synthesized in a cell receives a marker 
or peptide affinity tag thereby providing a method for identifying each protein expressed 
in a cell. 
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In a specific embodiment of the invention, PTM expression libraries 
encoding PTMs having different target binding domains comprising random nucleotide 
sequences are generated. The desired goal is to create a PTM expression library that is 
complex enough to produce a PTM capable of binding to each pre-mRNA expressed in a 
5 cell. In a preferred embodiment, the library is cloned into a manmialian expression vector 
that results in one, or at most, a few vectors being present in any one cell. 

To identify the expression of chimeric proteins, host cells are transformed 
with the PTM library and plated so that individual colonies containing one PTM vector 
can be grown and purified. Single colonies are selected, isolated, and propagated in the 

10 appropriate media and the labeled chimeric protein exon(s) fragments are separated away 
from other cellular proteins using, for example, an affinity purification tag. For example, 
affinity chromatography can involve the use of antibodies that specifically bind to a 
peptide tag such as the FLAG tag. Alternatively, when utilizing HIS tags, the fiision 
proteins are purified using a Ni^^ nitriloacetic acid agarose columns, which allows 

15 selective elution of bound peptide eluted with imidazole containing buffers. When using 
GST tags, the ftision proteins are purified using glutathione-S-transferase agarose beads. 
The fiision proteins can then be eluted in the presence of free glutathione. 

Following purification of the chimeric protein, an analysis is carried out 
to determine the amino acid sequence of the fiision protein. The amino acid sequence of 

20 the fiision protein is determined using techniques well known to those of skill in the art, 
such as Edman Degradation followed by amino acid analysis using HPLC, mass 
spectrometry or an amino acid analyzation. Once identified, the peptide sequence is 
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compared to those sequences available in protein databases, such as GenBank. If the 
partial peptide sequence is already known, no further analysis is done. If the partial 
protein sequence is unknown, then a more complete sequence of that protein can be 
carried out to determine the full protein sequence. Since the fusion protein will contain 
5 only a portion of the full length protein, a nucleic acid encoding the full length protein 
can be isolated using conventional methods. For example, based on the partial protein 
sequence oligonucleotide primers can be generated for use as probes or PGR primers to 
screen a cDNA library. 

6. EXAMPLE: PRODUGTION OF TRANS-S?UCIMG MOLEGULES 
10 The following section describes the production of PTMs and the 

demonstration that such molecules are capable of mediating ^a«5-splicing reactions 
resulting in the production of chimeric mRNA molecules. 

6.1. MATERIALS AND METHODS 
6.1.1. GONSTRUGTION OF PRE-mRNA MOLEGULES 
1 5 Plasmids containing the wild type diphtheria toxin subunit A (DT-A, wild- 

type accession #K01722) and a DT-A mutant (GRM 197, no enzymatic activity) were 
obtained fi^om Dr. Virginia Johnson, Food and Drug Administration, Bethesda, Maryland 
(Uchida et al, 1973 J. Biol. Ghem 248:3838). For in vitro experiments, DT-A was 
amplified using primers: DT-IF (5'-GGGGGTGGAGGGGGGTGATGATGTTGTTG); 
20 and DT-2R (5'-GGCGAAG CTTGGATCCGACACGATTTCCTGCACAGG), cut with 
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PstI and Hindlll, and cloned into PstI and Hindlll digested pBS(-) vector (Stratagene, 
La JoUa, CA). The resulting clone, pDTA was used to construct the individual PTMs. 
(1) pPTMH-: Targeted construct. Created by inserting IN3-1 (5*AATTCTCTAGATGCTT 
CACCCGGGCCTGACTCGAGTACTAACTGGTACCTCTTCTTTTTTTTCCTGCA) 

- 5 and IN2-4 (5'-GGAAAAAAAAGAAGAGGTACCAGTTAGTACTCGAGTCAGG 
CCCGGGTGAAGCATCTAGAG) primers into EcoRI and PstI digested pDTA. (2) 
pPTM+Sp: As pPTMH- but with a 30 bp spacer sequence between the BD and BP. 
Created by digesting pPTM+ with Xhol and ligating in the oligonucleotides, spacer S (5*- 
TCGAGCAACGTTATAATAATGTTC) and spacer AS (5'-TCGAGAACATTATT 

10 ATAACGTTGC). For in vivo studies, an EcoRI and Hindlll fragment of pcPTM-i-Sp was 
cloned into mammalian expression vector pcDNA3.1 (Invitrogen), under the control of a 
CMV promoter. Also, the methionine at codon 14 was changed into isoleucine to prevent 
initiation of translation. The resulting pla:smid was designated as pcPTM+Sp. (3) 
pPTM+CRM: As pPTM+Sp but the wild type DT-A was substituted with CRM mutant 

15 DT-A (T. Uchida, et al., 1973, J. Biol. Chem. 248:3838). This was created by PCR 
amplification of a DT-A mutant (mutation at G52E) using primers DT-IF and DT-2R. 
For in vivo studies, an EcoRI Hindlll fragment of PTM+CRM was cloned into pc3.1DNA 
that resulted in pcPTM+ARM. (4) PTM-: Non-targeted construct. Created by digestion 
of PTM+ with EcoRI and Pst I, gel purified to remove the binding domain followed by 

20 ligation of the oligonucleotides, IN-5 (5'-ATCTCTAGATCAGGCCCGGGTGAAGCC 
CGAG) and IN-6 (5'-TGCTTCACCC GGGCCTGATCTAGAG). (5) PTM-Sp, is an 
identical version of the PTM-, except it has a 30 bp spacer sequence at the PstI site. 
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Similarly, the splice mutants [Py(-)AG(-) and BP(-)Py(-)AG(-)] and safety variants 
[PTM+SF-Pyl, PTM+SF-Py2, PTM+SFBP3 and PTM+SFBP3-Pyl] were constructed 
either by insertion or deletion of specific sequences (see Table 1). 



Table 1. Binding/non-binding domain^ BP, PPT and 3* as sequences of different PTMs. 


PTM construct 


BD/NBD 


BP 


PPT 


3'ss 


PTM+Sp (targeted) 


:TGCTTCACCCGGGCCTGA 


TACTAAC 


CTCTTCTTTTTTTTCC 


CAG 


PTM-Sp (non-targeted) 


:CAACGTTATAATAATGTT 


TACTAAC 


CTCTTCTTTTTTTTCC 


CAG 


PTM+Py (.)AG(-)BP(-) 


:TGCTTCACCCGGGCCTGA 


GGCTGAT 


CTGTGATTAATAGCGG 


ACG 


PTM+Py(-)AG(-) 


:TGCTTCACCCGGGCCTGA 


TACTAAC 


CCTGGACGCGGAAGTT 


ACG 


PTM+SF 


:CTGGGACAAGGACACTGCTT 
CACCCGGTTAGTAGACCACA 
GCCCTGAAGCC 


TACTAAC 


CTTCTGTTTTmCTC 


CAG 


PTM+SF-Py 


:As in PTM+SF 


TACTAAC 


CTTCTGTATTATTCTC 


CAG 


PTM+SF-Py 


:As in PTM+SF 


TACTAAC 


GTTCTGTCCTTGTCTC 


CAG 


PTM+SF-BP3 


:As in PTM+SF 


TGCTGAC 


CTTCTGTTTTTTTCTC 


CAG 


PTM+SFBP3-Py 


:As in PTM+SF 


TGCTGAC 


CTTCTGTATTATTCTC 


CAG 



15 Nucleotides in bold indicate the mutations compared to normal BP, PPT and 3' splice site, k 

Branch site A is underlined. The nucleotides in italics indicates the mismatch introduced 
into safety BD to mask the BP sequence in the PTM. 

A double-/rara-splicing PTM construct (DS-PTM) was also made adding 

a 5* splice site and a second target binding domain complementary to the second intron of 

20 pHCG pre-mRNA to the 3' end of the toxin coding sequence of PTM+SF (Figure A). 

6.1.2. pHCG6 TARGET PRE-mRNA 
To produce the in vitro target pre-mRNA, a Sad fragment of pHCG gene 
6 (accession #X00266) was cloned into pBS(-). This produced an 805 bp insert from 
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nucleotide 460 to 1265, which includes the 5* untranslated region, initiation codon, 
exon 1, intron 1, exon 2, and most of intron 2. For in vivo studies, an EcoRI and BamHI 
fragment was cloned into mammalian expression vector (pc3.1DNA), producing pHCG6. 

6.1.3. mRNA PREPARATION 
5 For in vitro splicing experiments, pHCG6, P-globin pre-mRNA and 

different PTM mRNAs were synthesized by in vitro transcription of BamHI and Hindlll 
digested plasmid DNAs respectively, using T7 mRNA polymerase (Pasman & Garcia- 
Blanco, 1996, Nucleic Acids Res. 24:1638). Synthesized mRNAs were purified by 
electrophoresis on a denaturing polyacrylamide gel, and the products were excised and 
10 eluted. 

6.1.4 IN VITRO SPLICING 
PTMs and target pre-mRNA were annealed by heating at 98°C followed 
by slow cooling to 30-34 °C. Each reaction contained 4 |il of annealed mRN A complex 
(100 ng of target and 200 ng of PTM), IX splice buffer (2 mM MgCl2, 1 mM ATP, 5 

15 mM creatinine phosphate, and 40 mM KCI) and 4 \i\ of HeLa splice nuclear extract 

(Promega) in a 12.5 |il final volume. Reactions were incubated at 30°C for the indicated 
times and stopped by the addition of an equal volume of high salt buffer (7 M urea, 5% 
SDS, 100 mM LiCl, 10 mM EDTA and 10 mM TrisHCI, pH 7.5). Nucleic acids were 
purified by extraction with phenol:chloroform:isoamyl alcohol (50:49:1) followed by 

20 ethanol precipitation. 
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6.1.5. REVERSE TRANSCRIPTION-PCR REACTIONS 

RT-PCR analysis was performed using EZ-RT PGR kit (Perkin-Elmer, 

Foster City, CA). Each reaction contained 10 ng of cis- or ^raw^-spliced mRNA, or 

1-2 ^g of total mRNA, 0.1 \il of each 3' and 5' specific primer, 0.3 mM of each dNTP, IX 
5 EZ buffer (50 mM bicine, 1 1 5 mM potassium acetate, 4% glycerol pH 8.2), 2.5 mM 

magnesium acetate and 5 U of rTth DNA polymerase in a 50 nl reaction volume. 

Reverse transcription was performed at 60°C for 45 min followed by PCR amplification 

of the resulting cDNA as follows: one cycle of initial denaturation at 94°C for 30 sec, and 

25 cycles of denaturation at 94°C for 18 sec and aimealing and extension at 60°C for 
10 40 sec, followed by a 7 min final extension at 70°C. Reaction products were separated by 

electrophoresis in agarose gels. 

Primers used in the study were as follows: 

DT- 1 F: GGCGCTGCAGGGCGCTGATGATGTTGTTG 

DT-2R: GGCGAAGCTTGGATCCGACACGATTTCCTGCACAGG 
15 DT-3R: CATCGTCATAATTTCCTTGTG 

DT-4R: ATGGAATCTACATAACCAGG 

DT-5R: GAAGGCTGAGCACTACACGC 

HCG-R2: CGGCACCGTGGCCGAAGTGG, 

Bio-HCG-F: ACCGGAATTCATGAAGCCAGGTACACCAGG 
20 p-globulin-F: GGGCAAGGTGAACGTGGATG 

P-globulin-R: ATCAGGAGTGGACAGATCC 
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6.1.6. CELL GROWTH. TRANSFECTION AND mRN A ISOLATION 

Human lung cancer cell line HI 299 (ATCC accession # CRL-5803) was 
grown in RPMI medium supplemented with 10% fetal bovine serum at 37°C in a 5% CO2 
environment. Cells were transfected with pcSp+CRM (CRM is a non-functional toxin), a 
5 vector expressing a PTM, or vector alone (pcDNA3. 1) using lipofectamine reagent (Life 
Technologies, Gaithersburg, MD). The assay was scored for neomycin resistance (neo') 
colony formation two weeks after transfection. Four neo' colonies were selected and 
expanded under continued neo selection. Total cellular mRNA was isolated using RNA 
exol (BioChain Institute, Inc., San Leandro, CA) and used for RT-PCR. 

10 6.1.7. rig^A^^-SPLICING IN TUMORS IN NUDE MICE 

Eleven nude mice were bilaterally injected (except BIO, B 11 and B12 had 
1 tumor) into the dorsal flank subcutaneous space with 1x10^ HI 299 human lung tumor 
cells (day 1). On day 14, the mice were given an appropriate dose of anesthesia and 
injected with, or without electroporation (T820, BTX Inc., San Diego, CA) in several 

15 orientations with a total volume of 100 /A of saline containing 100 /xg pcSp+CRM with 
or without pcpHCG6 or pcPTM+Sp. Solutions injected into the right side tumors also 
contained India ink to mark needle tracks. The animals were sacrificed 48 hours later and 
the tumor excised and immediately frozen at -80 °C. For analysis, 10 mg of each tumor 
was homogenized and mRNA was isolated using a Dynabeads mRNA direct kit (Dynal) 

20 following the manufacturers directions. Purified mRNA (2 //I of 10 fxl total volume) was 
subjected to RT-PCR using pHCG-F and DT-5R primers as described earlier. All 
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samples were re-amplified using DT-SR, a nested DT-A primer and biotinylated pHCG-F 
and the products were analyzed by electrophoresis on a 2% agarose gel. Samples that 
produced a band were processed into single stranded DNA using M280 Streptavidin 
Dynabeads and sequenced using a toxin specific primer (DT-3R). 



5 6.2- RESULTS 

6.2.1. SYNTHESIS OF PTM 
A prototypical /rara-splicing mRNA molecule, pcPTM+Sp (Figure 1 A) 
was constructed that included: an 18 nt target binding domain (complementary to pHCG6 
intron 1), a 30 nucleotide spacer region, branch point (BP) sequence, a polypyrimidine 

10 tract (PPT) and an AG dinucleotide at the 3' splice site immediately upstream of an exon 
encoding diphtheria toxin subunit A (DT-A) (Uchida et al, 1973, J. Biol Chem. 
248:3838). Later DT-A exons were modified to eliminate translation initiation sites at 
codon 14. The PTM constructs were designed for maximal activity in order to 
demonstrate rraw^-splicing; therefore, they included potent 3' splice elements (yeast BP 

15 and a mammalian PPT) (Moore et a/., 1993, In The mRNA Worid, R.F. Gesteland and 
J.F. Atkins, eds. (Cold Spring Harbor, New York: Cold Spring Harbor Laboratory Press). 
pHCG6 pre-mRNA (Talmadge et al, 1984, Nucleic Acids Res. 12:8415) was chosen as a 
model target as this gene is expressed in most tumor cells. It is not expressed in normal 
adult cells, with the exception of some in the pituitary gland and gonads. (Acevedo et al,^ 

20 1992, Cancer 76: 1467; Hoon et a/., 1996, Int J. Cancer 69:369; Bellet et a/., 1997, Cancer 
Res. 57:516). As shown in Figure IC, pcPTM+Sp forms conventional Watson-Crick 
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base pairs by its binding domain with the 3' end of pHCG6 intron 1, masking the intronic 
3' splice signals of the target. This feature is designed to facilitate /ra«5-splicing between 
the target and the PTM. 

HeLa nuclear extracts were used in conjunction with established splicing 
5 procedures (Pasman & Garcia-Blanco, 1996, Nucleic Acids Res. 24:1638) to test if a 
PTM construct could invade the PHCG6 pre-mRNA target. The products of in vitro 
/raw^-splicing were detected by RT-PCR, using primers specific for chimeric mRNA 
molecules. The predicted product of a successful /ran^-splicing reaction is a chimeric 
mRNA comprising the first exon of pHCG6, followed immediately by the exon 
1 0 contributed from pcPTM+Sp encoding DT-A (Figure 1 C). Such chimeric mRNAs were 
readily detected by RT-PCR using primers pHCG-F (specific to pHCG6 exon 1) and DT- 
3R (specific to DT-A, Figure 2A, lanes 1-2). At time zero or in the absence of ATP, no 
466 bp product was observed, indicating that this reaction was both ATP and time 
dependent. 

1 5 The target binding domain of pcPTM+Sp contained 1 8 nucleotides 

complementary to PHCG6 intron 1 pre-mRNA and demonstrated efficient /raw^-splicing 
(Figure 2A, lanes 1-2). 7>aw5-splicing efficiency decreased at least 8 fold (Figure 2, 
lanes 3-4) using non-targeted PTM-Sp, which contains a non-complementary 18 
nucleotide "non-binding domain". Trara-splicing efficiencies of PTM mRNAs with or 

20 without a spacer between the binding domain and BP were also compared. This 

experiment demonstrated a significant increase in the efficiency of ^a/?5-splicing by the 
addition of a spacer (Figure 2B, lanes 2 + 5). To facilitate the recruitment of splicing 
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factors required for efficient tranS'Splicing, some space may be needed between the 3' 
splice site and the double-stranded secondary structure produced by the binding 
domain/target interaction. 

To investigate the effect of PTM length on /raw^'-splicing specificity, 
5 shorter PTMs were synthesized fi^om AccI cut PTM plasmid (see Figure I). This 

eliminated 479 nt from the 3' end of the DT-A coding sequence. Figure 2B shows the 
trans-splicing ability of a targeted short PTM(+) (lanes 10-12), compared to a non- 
targeted short PTM(-) (lanes 14-17). Short PTM+ produced substantially more trans- 
spliced product (Figure 2B, lane 12) than its counterpart, non-targeted shorl PTM .(Figure 
10 2B, lane 17). These experiments indicate that longer PTMs may have increased potential 
to mediate /ra«5'-splicing non-specifically. 



6.2.2. ACCURACY OF PTM SPLICEOSOME MEDIATED 
TRANS-SFLICJNG 

To confirm that trans-splicing between the pcPTM+Sp and pHCG6 target 

1 5 is precise, RT-PCR amplified product was produced using 5' biotinylated pHCG-F and 

nonbiotinylated DT-3R primers. This product was converted into single stranded DNA 

and sequenced directly with primer DT-3R (DT-A specific reverse primer) using the 

method of Mitchell and Merril (1989, Anal. Biochem. 178:239). Trans-splicing occurred 

exactly between the predicted splice sites (Figure 3), confirming that a conventional pre- 

20 mRNA can be invaded by an engineered PTM construct during splicing; moreover, this 

reaction is precise. 
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In addition selective /rara-splicing of a double splicing PTM (DS-PTM) 
was observed (Figure 8B). The DS-PTM can produce /ran^-splicing by contributing 
either a 3' or 5' splice site. Further, DS-PTMs can be constructed which will be capable 
of simultaneously double-/ram-splicing, at both a 3' and 5' site, thereby permitting exon 
5 replacement. Figure SB demonstrates that in this construct the 5' splice site is most active 
at a 1 : 1 concentration of target pHCG pre-mRNA:DS-PTM. At a 1 :6 ratio the 3' splice 
site is more active. 

6.2.3. 3* SPLICE SITES ARE ESSENTIAL FOR PTM ^TV^^-SPLICING 
In general, the 3' splice site contains three elements: 1) a BP sequence 

10 located 5' of the acceptor site, 2) a PPT consisting of a short run of pyrimidine residues, 
and 3) a YAG trinucleotide splice site acceptor at the intron-exon border (Senapathy et 
al, 1990, Cell 91 :875; Moore et a/., 1993). Deletion or alteration of one of these 
sequence elements are knovra to either decrease or abolish splicing (Aebi et aL, 1986; 
Reed & Maniatis 1988, Genes Dev. 2:1268; Reed, 1989, Genes Dev. 3:21 13; Roscigno et 

15 a/., 1993, J. Biol. Chem. 268:1 1222; Coolidge et al, 1997, Nucleic Acids Res. 25:888). 
The role of these conserved elements in targeted /rara-splicing was addressed 
experimentally. In one case [(BP(-)Py(-)AG(-)], all three cis elements (BP, PPT and 
AG dinucleotide) were replaced by random sequences. A second splicing mutant [(Py(- 
)AG(-)] was constructed in which the PPT and the 3' splice site acceptor were mutated 

20 and substituted by random sequences. Neither construct was able to support trans- 

splicing in vitro (Figure 2A, lanes 5-8), suggesting that, as in the case of conventional cis- 

NY02:301604.1 -56- 



k51304B-A-A 072874.0134 



splicing, the PTM trans-splicing process also requires a functional BP, PPT and AG 
acceptor at the 3' splice site. 

6.2.4. DEVELOPMENT OF A "SAFETY" SPLICE SITE 
TO INCREASE SPECIFICITY 

To improve the levels of target specificity achieved by the inclusion of a 

binding domain or by shortening the PTM, the target-binding domain of several PTM 

constructs v^as modified to create an intra-molecular stem to mask the 3' splice site 

(termed a "safety PTM"). The safety stem is formed by portions of the binding domain 

that partially base pair with regions of the PTM 3' splice site or sequences adjacent to 

them, thereby blocking the access of spliceosomal components to the PTM 3* splice site 

prior to target acquisition (Figure 4A, PTM+SF). Base pairing between free portions of 

the PTM binding domain and pHCG6 target region unwinds the safety stem, allov^ng 

splicing factors such as U2AF to bind to the PTM 3* splice site and initiate /ra«^-splicing. 

(Figure 4B). 

This concept was tested in splicing reactions containing either PTM+SF 
(safety) or pcPTM+Sp (linear), and both target (pHCG6) and non-target (p-globin) pre- 
mRNA. The spliced products were subsequently analyzed by RT-PCR and gel 
electrophoresis. Using pHCG-F and DT-3R primers, the specific 196 bp trans-spliced 
band was demonstrated in reactions containing pHCG target and either linear PTM 
(pcPTM+Sp, Figure 5, lane 2) or safety PTM (PTM+SF, Figure 5, lane 8). Comparison 
of the targeted trans-splicing between linear PTM (Figure 5, lane 2) and safety PTM 
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(Figure 5, lane 8) demonstrated that the safety PTM trans-splicQd less efficiently than the 
linear PTM. 

Non-targeted reactions were amplified using p-globin-F (specific to 
exon 1 of p-globin) and DT-3R primers. The predicted product generated by non-specific 
5 PTM trans-splicing with p-globin pre-mRNA is 1 89 bp. Non-specific /raw^-splicing was 
evident between Unear PTM and P-globin pre-mRNA (Figure 5, lane 5). In contrast, non- 
specific /ram-splicing was virtually eliminated by the use of safety PTM (Figure 5, 
lane 1 1). This was not unexpected, since the linear PTM was designed for maximal 
activity to prove the concept of spliceosome-mediated /raw^'-splicing. The open structure 

10 of the linear PTM combined with its potent 3' splice sites strongly promotes the binding 
of splicing factors. Once bound, these splicing factors can potentially initiate trans- 
splicing with any 5' splice site, in a process similar to /r<3f«^-splicing in trypanosomes. 
The safety stem was designed to prevent splicing factors, such as U2AF fi-om binding to 
the PTM prior to target acquisition. This result is consistent with a model that base- 

1 5 pairing between the fi-ee portion of the binding domain and the pHCG6 target unwdnds 
the safety stem (by mRNA-mRNA interaction), uncovering the 3' splice site, permitting 
the recruitment of splicing factors and initiation of ^raw^-splicing. No /r^jfw^-splicing was 
detected between p-globin and pHCG6 pre-mRNAs (Figure 5, lanes 3, 6, 9 and 12). 

6.2.5. IN VITRO 77g^A^5'-SPLICING OF SAFETY PTM AND VARIANTS 
20 To better understand the role of c/^-elements at the 3* splice site in trans- 

splicing a series of safety PTM variants were constructed in which either the PPT was 
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weakened by substitution with purines and/or the BP was modified by base substitution 
(see Table I). In vitro /ra«5-splicing efficiency of the safety (PTM+SF) was compared to 
three safety variants, which demonstrated a decreased abiUty to trans-s^Wct, The greatest 
effect was observed with variant 2 (PTM+SFPy2), which was /raw^-spUcing incompetent 
5 (Figure 40, lanes 5-6). This inhibition of /raw.v-splicing may be attributed to a weakened 
PPT and/or the higher T^, of the safety stem. In contrast, variations in the BP sequence 
(PTM+SFBP3) did not markedly effect /ran^-splicing (Figure 4C, lanes 7-8). This was 
not surprising since the modifications introduced were within the mammalian branch 
point consensus range YNYURAC (where Y = pyrimidine, R = purine and N = any v,.. 

10 nucleotide) (Moore et aL, 1993). This finding indicates that the branch point sequence 
can be removed without affecting splicing efficiency. Alterations in the PPT (PTM+SF- 
Pyl) decreased the level of trans-splicing (lanes 3-4). Similarly, when both BP and PPT 
were altered PTM+SFBP3-Pyl, they caused a further reduction in /raw^-splicing 
(Figure 4C, lanes 9-10). The order of trans-splicing efficiency of these safety variants is 

1 5 PTM+SF>PTM+SFBP3> PTM+SFPyl>PTM+SFBP3-Pyl>PTM+SFPy2. These results 
confirm that both the PPT and BP are important for efficient in vitro /ra«5-splicing 
(Roscigno etal., 1993, J. Biol. Chem. 268:11222). 



6.2.6. COMPETITION BETWEEN CIS- AND TRANS- SPLICING 

To determine if it was possible to block pre-mRNA c/5-splicing by 
20 increasing concentrations of PTM, experiments were performed to drive the reaction 
towards trans-splicing. Splicing reactions were conducted with a constant amount of 
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pHCG6 pre-mRNA target and various concentrations of /ran^-splicing PTM. Cis- 
splicing was monitored by RT-PCR using primers to pHCG-F (exon 1) and pHCG-R2 
(exon 2). This amplified the expected 125 bp m-spliced and 478 bp unspliced products 
(Figure 6A). The primers pHCG-F and DT-3R were used to detect trans-spMccd products 
(Figure 6B). At lower concentrations of PTM, c/>splicing (Fig. 6 A, lanes 1-4) 
predominated over ^raw^-splicing (Figure 6B, lanes 1-4). Cz5-splicing was reduced 
approximately by 50% at a PTM concentration L5 fold greater than target. Increasing the 
PTM mRNA concentration to 3 fold that of target inhibited c/.y-splicing by more than 
90% (Figure 6A, lanes 7-9), with a concomitant increase in the trans-splicGd product . 
(Figure 6B, lanes 6-10). A competitive RT-PCR was performed to simultaneously 
amplify both cis and /raw^-spliced products by including all three primers (pHCG~F, 
HCG-R2 and DT-3R) in a single reaction. This experiment had similar results to those 
seen in Figure 6, demonstrating that under in vitro conditions, a PTM can effectively 
block target pre-mRNA c/^-splicing and replace it with the production of an engineered 
trans-spliced chimeric mRNA. 



6.2.7. TRANS-SPLICING IN TISSUE CULTURE 
To demonstrate the mechanism of trans-splicing in a cell culture model, 
the human lung cancer line HI 299 (pHCG6 positive) was transfected with a vector 
expressing SP+CRM (a non-functional diphtheria toxin) or vector alone (pcDNA3.1) and 
grovm in the presence of neomycin. Four neomycin resistant colonies were individually 
collected after 14 days and expanded in the continued presence of neomycin. Total 
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mRNA was isolated from each clone and analyzed by RT-PCR using primers pHCG-F 
and DT-3R. This yielded the predicted 196 bp trans-spliced product in three out of the 
four selected clones (Figure 7A, lanes 2, 3 and 4). The amplified product from clone #2 
was directly sequenced, confirming that PTM driven /raw^-splicing occurred in human 
5 cells exactly at the predicted splice sites of endogenously expressed pHCG6 target exon 1 
and the first nucleotide of DT-A (Figure 7B). 

6.2.8. TRANS-SPLICING IN AN /A^ VIVO MODEL 
To demonstrate the mechanism of /ran5-splicing in vivo, the following . 
experiment was conducted in athymic (nude) mice. Tumors were established by injecting 

10 10^ HI 299 cells into the dorsal flank subcutaneous space. On day 14, PTM expression 
plasmids were injected into tumors. Most tumors were then subjected to electroporation 
to facilitate plasmid delivery (see Table 2, below). After 48 hrs, tumors were removed, . 
poly -A mRNA was isolated and amplified by RT-PCR. Trans-splicing was detected in 8 
out of 19 PTM treated tumors. Two samples produced the predicted /ra«^-spliced 

15 product (466 bp) from mRNA after one round of RT-PCR. Six additional tumors were 
subsequently positive for /raw^-splicing by a second PCR amplification using a nested set 
of primers that produced the predicted 196 bp product (Table 2). Each positive sample 
was sequenced, demonstrating that PHCG6 exon 1 was precisely /ra«5-spliced to the 
coding sequence of DT-A (wild type or CRM mutant) at the predicted splice sites. Six of 

20 the positive samples were from treatment groups that received cotransfected plasmids, 
pcPTM+CRM and pcHCG6, which increased the concentration of target pre-mRNA. 
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This was done to enhance the probability of detecting trans-spliced events. The other 
two positive tumors were from a group that received only pcPTM+Sp (wild type DT-A). 



These tumors were not transfected with pHCG6 expression plasmid, demonstrating once 
again, as in the tissue culture model described in Section 6.2.7, that /rara-splicing 
5 occurred between the PTM and endogenous pHCG6 pre-mRNA produced by tumor cells. 





Table 2. Trans-splicing in tumors in nude mice. 


Mouse 


Plasmid 


Left Right 


Electroporation 


RT-PCR 
Left Right 


Nested PGR 


Nucleotide Sequence 


'Bl 


pCMV-Sport 


Bl-1 Bl-2 










B2 


pCMV-Sport 


Bl-3 Bl-4 


MOOOV/cm 


- 


- 


- 


r>3 


pcSp+CRM 


B3-1 B3-2 


MOOOV/cm 












B3-3 B3-4 


MOOOV/cm 








B4 


pcSp-fCRM 


B4-1 B4-2 


^'SOV/cm 












B4-3 B4-4 


^25V/cm 








B5 


pcSp+CRM/ 
pcHCG6 


B5-1 B5-2 


MOOOV/cm 






ATGTTCCAG 1 GGCGTGATGAT 
(SEQ ID NO:53) 






B5-3 B5-4 


MOOOV/cm 






ATGTTCCAG 1 GGCGTGATGAT 
(SEQ IDNO:53) 


B6 


pcSp+CRM/ 
pcHCG6 


B6-1 B6-2 


*'50V/cm 












B6-3 B6-4 


*=25V/cm 






ATGTTCCAG i GGCGTGATGAT 
(SEQ ID NO:53) 


B7 


pc PTM+Sp 


B7-1 


MOOOV/cm 








B8 


pc PTM+Sp 


B8-1 


^'SOV/cm 




+ 


ATGTTCCAG 1 GGCGTGATGAT 
(SEQIDNO:53) 


'B9 


pc PTM+Sp 


B9-1 






+ 


ATGTTCCAG 1 GGCGTGATGAT 
(SEQ IDNO:53) 



\ 6 pulses of 99//S sets of 3 pulses administered orthogonally 
\ 8 pulses of 10ms sets of 4 pulses administered orthogonally 
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■^r 8 pulses of 50ms sets of 4 pulses administered orthogonally 

positive for RT-PCR trans-spliced produce 
' : did not receive electroporation 



7. EXAMPLE: lacZ r/L4A^S'-SPLICING MODEL 
In order to demonstrate and evaluate the generality of the mechanism of 
spliceosome mediated targeted /raw^-splicing between a specific pre-mRNA target and a 
PTM, a simple model system based on expression of enzyme P-galactosidase was 
5 developed. The following section describes results demonstrating successful splicesomCv 
mediated targeted trans-s^\\cm% between a specific target and a PTM. 

7.1. MATERIALS AND METHODS 
7.1.1. PRIMER SEQUENCES 
The following primers were used for testing the lacZ model system: 

10 5' Lac- IF GCATGAATTCGGTACCATGGGGGGGTTCTCATCATCATC 

5' Lac- 1 R CTGAGGATCCTCTTACCTGTAAACGCCC ATACTGAC 

3' Lac- 1 F GC ATGGTAACCCTGC AGGGCGGCTTCGTCTGGGACTGG 

3' Lac-IR CTGAAAGCTTGTTAACTTATTATTTTTGACACCAGACC 

3' Lac-Stop GCATGGTAACCCTGCAGGGCGGCTTCGTCTAATAATGGGACTGGGTG 

15 HCG-InlF GCATGGATCCTCCGGAGGGCCCCTGGGCACCTTCCAC 

HCG-InlR CTGACTGCAGGGTAACCGGACAAGGACACTGCTTCACC 

HCG-Ex2F GCATGGTAACCCTGCAGGGGCTGCTGCTGTTGCTG 

HCG-Ex2R CTGAAAGCTTGTTAACCAGCTCACCATGGTGGGGCAG 
Lac-TRl (Biotin): 7-GGCTTTCGCTACCTGGAGAGAC 

20 Lac-TR2 GCTGGATGCGGCGTGCGGTCG 

HCG-R2: CGGCACCGTGGCCGAAGTGG 



7. 1.2. CONSTRUCTION OF THE lacZ PRE-mRNA TARGET MOLECULE 



NY02:301604.1 



-63- 



1304B-A-A 072874.0134 



The lacZ target 1 pre-mRNA (pc3.1 lacTl) was constructed by cloning of 
the following three PGR products: (i) the 5' fragment of lacZ; followed by (ii) pHCG6 
intron 1; (iii) and the 3' fragment of lacZ. The 5' and 3* fragment of the lacZ gene were 
PGR amplified from template pcDNA3.1/His/lacZ (Invitrogen,San Diego, GA) using the 
5 following primers: 5' Lac- IF and 5'Lac-lR (for 5* fragment), and 3*Lac-lF and 3' Lac-1 R 
(for 3' fragment). The amplified lacZ 5' fragment is 1788 bp long which includes the 
initiation codon, and the amplified 3' fragment is 1385 bp long and has the natural 5' and 
3' splice sites in addition to a branch point, polypyrimidine tract iand PHCG6 intron 1 . 
The pHGG6 intron 1 was PGR amplified using the following primers: HCG-InlF and . 
10 HGG-InlR. 

The lacZ target 2 is an identical version of lacZ target 1 except it contains 
two stop codons (TAA TAA) in frame four codons after the 3 'splice site. This was 
created by PGR amplification of the 3' fragment (lacZ) using the following primers: 3* 
Lac-Stop and 3' Lac IR and replacing the functional 3* fragment in lacZ target 1. 



15 7.1.3. GONSTRUGTION OF pc3.1 PTMl and pc3.1 PTM2 

The pre-trans-splicing molecule, pc3.1 PTMl was created by digesting 
pPTM +Sp with PstI and Hindlll and replacing the DNA fragment encoding the DT-A 
toxin with the a DNA fragment encoding the functional 3* end of lacZ. This fragment 
was generated by PGR amplification using the following primers: 3* Lac- IF and 3* Lac- 

20 IR. For cell culture experiments, an EcoRI and Hindlll fragment of pc3.1 PTM2 which 
contains the binding domain to HGG intron 1, a 30 bp spacer, a yeast branch point 
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(TACTAAC), and strong polypyrimidine tract followed by the lacZ cloned was cloned 
into pcDNA3.1. 

The pre -trans-splicing molecule, pc3.1 PTM2 was created by digesting 
pPTM +Sp with PstI and Hindlll and replacing the DNA fragment encoding the DT-A 
5 toxin with the pHCG6 exon 2. pHCG6 exon 2 was generated by PGR amplification 

using the following primers: HCG-Ex2F and HCG-Ex2R. For cell culture experiments, 
an EcoRI and Hindlll fragment of pc3.1 PTM2 which contains the binding domain to 
HGG intron 1, a 30 bp spacer, a yeast branch point (TACTAAC), and strong 
polypyrimidine tract followed by the PHCG6 exon 2 cloned was used. 

10 7.1.4. CO-TRANSFECTION OF THE lacZ SPLICE TARGET 

PRE-mRNA AND PTMS INTO 293T CELLS 

Human embryonic kidney cells (293T) were grown in DMEM medium 

supplemented with 10% FBS at 37*^C in a 5% CO2. Cells were co-transfected with pc3.L. 

LacTl and pc3.1 PTM2, or pc3.1 LacT2 and pc3.1 PTMl, using Lipofectamine Plus 

15 (Life Technologies,Gaithersburg, MD) according to the manufacturer's instructions. 24 

hours post-transfection, the cells were harvested; total RNA was isolated and RT-PCR 

was performed using specific primers for the target and PTM molecules, p-galactosidase 

activity was also monitored by staining the cells using a p-gal staining kit (Invitrogen, 

San Deigo. CA). 

20 

7.2. RESULTS 
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7.2.1. THE lacZ SPLICE TARGET C/5-SPLICES EFFICIENTLY TO 
PRODUCE FUNCTIONAL p-GALACTOSIDASE 



To test the ability of the spHce target pre-mRNA to c/^-splice efficiently, 



5 pc3. 1 lacTl was transfected into 293 T cells using Lipfectamine Plus reagent (Life 

Technologies,Gaithersburg, MD) followed by RT-PCR analysis of total RNA. Sequence 
analysis of the c/^-spliced RT-PCR product indicated that splicing was accurate and 
occurred exactly at the predicted splice sites (Fig. 12B). In addition, accurate cis-splicing 
of the target pre-mRNA molecule results in formation of a mRNA capable of encoding 
1 0 active p-galactosidase which catalyzes the hydrolysis of P-galactosidase, z. e. , X-gal, 

producing a blue color that can be visualized under a microscope. Accurate c/^-splicing of 
the target pre-mRNA was further confirmed by successfully detecting p-galactosidase 
enzyme activity. 



1 5 functional 3' lacZ fragment (PTMl) was measured by staining for p-galactosidase 
enzyme activity. For this purpose, 293T cells were co-transfected with lacZ target 2 
pre-mRNA (containing a defective 3' fragment) and PTMl (contain normal 3' lacZ 
sequence). 48 hours post-transfection cells were assayed for p-galactosidase enzyme 
activity. Efficient trans-splicing of PTMl into the lacZ target 2 pre-mRNA will result in 

20 the production of functional p-galactosidase activity. As demonstrated in Figure 1 IB-E, 
trans-splicing of PTM 1 into lacZ target 2 results in restoration of p-galactosidase enzyme 
activity up to 5% to 10% compared to control. 



Repair of defective lacZ target 2 pre-mRNA by trans-splicing of the 
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7.2.2. TARGETED TRANS-S?UCmG BETWEEN 
THE lacZ TARGET PRE-mRN A and PTM2 

To assay for /rara-splicing, lacZ target pre-mRNA and PTM2 were 

transfected into 293 T cells. Following transfection, total RNA was analyzed using RT- 

5 PGR. The following primers were used in the PGR reactions: lacZ-TRl (lacZ 5' exon 

specific) and HGGR2 (pHGGR exon 2 specific). The RT PGR reaction produced the 

expected 195 bp /raw^-spliced product ( Fig. 11, lanes 2 and 3) demonstrating efficient 

trans-splicing between the lacZ target pre-mRNA and PTM 2. Lane 1 represents the 

control, which does not contain PTM 2. 

1 0 The efficiency of the /raw^-splicing was also measured by staining for p- 

galactosidase enzyme activity. To assay for trans-splicing, 293T cells were co- 
transfected with lacZ target pre-mRNA and PTM 2. 24 hours post-transfection, cells 
were assayed for p-galactosidase activity. If there is efficient ^ara-splicing between the 
target pre-mRNA and the PTM, a chimeric mRNA is produced consisting of the 5' 

1 5 fragment of the lacZ target pre-mRNA and pHGG6 exon 2 is formed which is incapable 
of coding for an active P-galactosidase. Results from the co-transfection experiments 
demonstrated that ^aw^-splicing of PTM2 into lacZ target 1 resulted in the reduction of 
p-galactosidase activity by compared to the control. 

To fiirther confirm that trans-splicing between the lacZ target pre-mRNA 

20 and PTM2 is accurate, RT-PGR was performed using 5* biotinylated iacZ-TRl and 
non-biotinylated HGGR2 primers. Single stranded DNA was isolated and sequenced 
directly using HGGR2 primer (HGG exon 2 specific primer). As evidenced by the 
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sequence of the splice junction, /ra/75-splicing occurred exactly as predicted between the 
splice sites (Fig. 12A and 128), confirming that a conventional pre-mRNA can be 
invaded by an engineered PTM during splicing, and moreover, that this reaction is 
precise. 

5 8. EXAMPLE: CORRECTION OF THE CYSTIC FIBROSIS 

TRANSMEMBRANE REGUALTOR GENE 

Cystic fibrosis (CF) is one of the most common genetic diseases in the 

world. The gene associated with CF has been isolated and its protein product deduced 

(Kerem, B.S. et al., 1989, Science 245:1073-1080; Riordan et al., 1989, Science 

10 245: 1066-1 073 ;Rommans, et aL, 1989, Science 245:1059-1065). The protein product of 
the CF associated gene is referred to as the cystic fibrosis trans-membrane conductance 
regulator (CFTR). The most common disease-causing mutation which accounts for 
--70% of all mutant alleles is a deletion of three nucleotides in exon 10 that encode for a 
phenylalanine at position 508 (AF508). The following section describes the successful 

15 repair of the cystic fibrosis gene using spliceosome mediated /r^zw^-splicing and 
demonstrates the feasibility of repairing CFTR in a model system. 



8.1 MATERIALS AND METHODS 
8.1.1. PRE-TRANS-SPLICING MOLECULE 
The CFTR pre-trans-splicing molecule (PTM) consists of a 23 nucleotide 
20 binding domain complimentary to CFTR intron 9 (3' end, -13 to -3 1), a 30 nucleotide 
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spacer region (to allow efficient binding of spliceosomal components), branch point (BP) 
sequence, polypyrimidine tract (PPT) and an AG dinucleotide at the 3* splice site 
immediately upstream of the sequence encoding CFTR exon 10 (wild type sequence 
containing F508). This initial PTM was designed for maximal activity in order to 
demonstrate trans-splicing; therefore the PTM included a UACUAAC yeast consensus 
BP sequence and an extensive PPT. An 18 nucleotide HIS tag (6 histamine codons) was 
included after wild type exon 10 coding sequence to allow specific amplification and 
isolation of the /ra«5-spliced products and not the endogenous CFTR. The 
oligonucleotides used to generate the two fragments included unique restriction sites. 
(Apal and PstI, and PstI and NotI, respectively) to facilitate directed cloning of amplified 
DNA into the mammalian expression vector pcDNA3. ] . 

8.1.2. THE TARGET CFTR PRE-mRNA MINI-GENE 
The CFTR mini-gene target is shown in Figure 13 and consists of CFTR 
exon 9 ; the fiinctional 5* and 3' regions of intron 9 (260 and 265 nucleotides fi"om each 
end, respectively); exon 10 [AF508]; and the 5* region of intron 10 (96 nucleotides). In 
addition, as depicted in Figure 16, a mini-target gene comprising CFTR exons 1-9 and 
10-24 can be used to test the use of spliceosome mediated trans-splicing for correction of 
the cystic fibrosis mutation. Figure 17, shows a double splicing PTM that may also be 
used for correction of the cystic fibrosis mutation. As shown, the double splicing PTM 
contains CFTR BD intron 9, a spacer, a branch point, a polypyrimidine tract, a 3' splice 
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site, CFTR exon 10, a spacer, a branch point, a polypyrimidine tract, a 5' splice site and 
CFTRBD exon 10. 



8.1.3. OLIGONUCLEOTIDES 
The following oligonucleotides were used to create CFTR PTM: 



SForward CF3 





ACCT 


GGGCCC ACC CATTATTAGGTC ATTAT CCGCGGAACATTATA 
Apal site. Intron 9 CFTR, - 1 2 to -34. 


Reverse 

) 


CF4 
ACCT 


CTGCAGGTGACC CTG CAG GAA AAA AAA GAA G 
Pstl. BstEI. PPT. 


Forward 


CF5 
ACCT 


CTGCAG ACT TCA CTT CTA ATG ATG AT 
Pstl . Exon 1 0 CFTR, + 1 to 4 24 


Reverse 


CF6 





1 5 ACCT GCGGCCGC CTA ATG ATG ATG ATG ATG ATG CTC TTC TAG TTG GCA TGC 

Not I. Stop Polyhistamine tag Exon 1 0 CFTR, 4 1 5 to + 1 3 2 



The following nucleotides were used to create the CFTR TARGET pre-mRNA 
mini gene (Exon 9 + mini-Intron 9 + Exon 10 + 5' end Intron 10): 

Forward CF18 

20 GACCT CTCGAGGGATTTGGGGAATTATTTGAG 

Xhol Exon 9 CFTR, 1 to 21. 

Reverse CF19 

CTGACCT GCGGCCGC TAC AGT GTT GAA TGT GGT GC 
NotL Intron 9 5' end. 

25 Forward CF20 

CTGACCT GCGGCCGC CCA ACT ATC TGA ATC ATG TG 
Notl. Intron 9 3' end. 

Reverse CF21 

GACCT CTTAAGTAG ACTAACCGATTGAATATG 
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Aflll Intron 10 5'end. 

The following oligonucleotides were used for detection of trans-spliced products: 

Reverse Bio-His 

CTA ATG ATG ATG ATG ATG ATG 
5 Stop. Polyhistidine tag (5' biotin label). 

Reverse Bio-His(2) 

CGC CTA ATG ATG ATG ATG ATG 

3' UT Stop. Polyhistidine tag (5' biotin label). 

Forward CF8 
1 0 CTT CTT GGT ACT CCT GTC CTG 

Exon 9 CFTR. 

Forward CF18 

GACCT CTCGAG GGA TTT GGG GAA TTA TTT GAG 
Xhol. Exon 9 CFTR. 

IS Reverse CF28 

AAC TAG AAG GCA CAG TCG AGG 

Pc3.1 vector sequence (present in PTM 3' UT but not target). 

8.2. RESULTS 
The PTM and target pre-mRNA were co-transfected in 293 embryonic 

20 kidney cells using lipofectamine (Life rechnologies,Gaithersburg, MD). Cells were 

harvested 24 h post transfection and RNA was isolated. Using PTM and target-specific 
primers in RT-PCR reactions, a trans-spliced product was detected in which mutant exon 
10 of the target pre-mRNA was replaced by the wild type exon 10 of the PTM 
(Figure 14). Sequence analysis of the trans-spliced product confirmed the restoration of 

25 the three nucleotide deletion and that splicing was accurate, occurring at the predicted 

splice sites (Figure 15), demonstrating for the first time RNA repair of the cystic fibrosis 
gene, CFTR (Mansfield et al., 2000, Gene Therapy 7:1885-1895). 
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9. EXAMPLE: DOUBLE-^TVS'-SPLICING 
The following example demonstrates accurate replacement of an internal 
exon by a double-rraw^-splicing between a target pre-mRNA and a PTM RNA containing 
both 3' and 5* splice sites leading to production of full length functionally active protein. 
5 As described herein, any pre-mRNA can be reprogrammed by providing a 

/r<7«5-reactive RNA molecule containing either a 3'-splice site, a 5*-splice site or both. 
The following example describes successful targeting and replacement of a single intemal 
exon utilizing pre-^rara-splicing molecules (PTMs) containing both the 5* and 3' splice 
sites. Such PTMs can promote two /ra«^-splicing reactions with the intended target gene 

10 mediated by the splicesome(s). To test this mechanism, a splicing lacZ model target gene 
consisting of lacZ 5' "exon" - CFTR mini-intron 9 - CFTR exon 10 (AF508) - CFTR 
mini-intron 10 followed by lacZ 3' "exon" was created. In this target transcript, a 124 bp 
central portion of the p-galactosidase ORF was substituted by exon 10 (AF508) of CFTR, 
thus it produces non-functional protein. A PTM consisting of the missing 124 bp lacZ 

15 "mini-exon" and a 5' and 3' trans-splicing domain containing binding domains (BDs) 
complementary to the target introns and exons was created. Transfection of HEK 293T 
cells with either target alone or PTM alone showed no detectable levels of p-gal activity. 
In contrast, 293T cells transfected with target plus PTM produced substantial levels of P- 
gal activity indicating the restoration of protein function. The accuracy of trans-splicing 

20 between the target and PTM was confirmed by sequencing the appropriate RT-PCR 
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product, which revealed the predicted internal exon substitution. The feasibility of this 
approach in a disease model was tested by replacing the CFTR AF508 exon 10 with 
normal exon 10 containing F508 in cystic fibrosis. These results demonstrate that a 
/raw5-splicing technology can be easily adapted to correct many of the genetic defects 
5 whether they are associated with the 5' exon or 3* exon or any intemal exon of the gene. 

Figure 18 is a schematic of a model lacZ target consisting of lacZ 5' exon - 
CFTR mini-intron 9 - CFTR exon 10 (delta 508) - CFTR min-intron 10 followed by the 
lacZ 3* exon. In this target, a 124 bp central portion of the lacZ gene is substituted with 
CFTR exon 10 which has a mutation at position 508 (delta 508). The pre-mRNA target 

10 undergoes normal cz\y-splicing to produce an mRNA consisting of lacZ 5' exon - CFTR 
exon 10 (delta 508) followed by the lacZ 3' exon. Because of the disruption in 
p-galactosidase ORF it produces truncated proteins which are non-functional. 

To restore p-gal function by double-/ra«5-splicing, three PTMs were 
created consisting of the missing 124 bp lacZ "mini-exon" and a 5' and 3' tram-splicing 

15 domain containing binding domains complementary to the target introns and exons as 
shown in Figure 19. These PTMs have an 120 bp 3' binding domain (complementary to 
intron 9) from PTM24 (see below) used in 3* exon replacement, spacer sequence, yeast 
branch point, polypyrimidine tract, 3* acceptor AG dinucleotide, lacZ "mini-exon", 5* 
splice site, spacer sequence followed by the 5' binding domain. These PTMs differ only 

20 in their 5' binding domain sequences. DSPTM5 has a 27 bp BD which is complementary 
to intron 10 and blocks just the 5' splice site of the target. DSPTM6 has 120 bp 5* BD 
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and covers both 5' and 3' splice sites of the target, while, DSPTM7 has 260 bp BD which 
masks both the splice sites (5' and 3') and also covers the entire exon of the target. 

A schematic representation of a double-^awi'-splicing reaction showing 
the binding of DSPTM7 with DSCFT1.6 target pre-mRNA is shown in Figure 20. 3' BD: 
5 120 bp binding domain complementary to mini-intron 9; 5' BD (260 bp); second binding 
domain complementary to mini-intron 10 and exon 10. ss: splice sites; BP: branch point, 
and PPT: polypyrimidine tract. 

The important structural elements of DSPTM7 (Figure 21) are as follows: 

fn3'BDri2Q BP^ : GATTCACTrGCTCCAATTATCATCCTAAGCAGAAGTGTAT 
1 0 ATTCTTATTTGTAAAGATTCTATTAACTCATTTGATTCAA 

AATATTTAAAATACTTCCTGTTTCATACTCTGCTATGCAC 

(2) Spacer sequences f24 bp^ : AACAITATTATAACGTTGCTCGAA 

(3) Branch point, pyrimidine tract and acceptor splice site : 

3'ss 

15 BP Kpnl PPT EcoRV liacz mini-exon 

TACTAAC T GGTACC TCTTCTTTTTTTTTT GATATC CTGCAG | GGC GGC 

(4) 5' donor site and 2"** spacer sequence : 

5' ss 

lacZ mini-exon 1 

20 | TGAACG | GTAAGT GTTATCACCGATATGTGTCTAACCTGATTCGGGCCTTC 
GATACGCTAAGATCCACCGG 

(5) 5' BD r260 BP) : TCAAAAAGTTTTCACATAATTTCTTACCTCTTCTTGAATT 

CATGCTTTGATGACGCTTCTGTATCTATATTCATCATTGG 
AAACACCAATGATTTTTCTTTAATGGTGCCTGGCATAATC 
25 CTGGAAAACTGATAACACAATGAAATTCTTCCACTGTGC 

TTAAAAAAACCCTCTTGAATTCTCCATTTCTCCCATAATC 
ATCATTACAACTGAACTCTGGAAATAAAACCCATCATTA 
TTAACTCATTATCAAATCACGC 
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To determine whether the restoration of p-gal function is RNA trans- 
splicing mediated, the mutants are depicted in Figure 22. DSPTM8 is a 3* splice mutant 
in which the 3' splice elements such as BP, polypyrimidine tract and the 3' acceptor 
AG dinucleotides were deleted and replaced with random sequences. This PTM still has 
5 3' and 5' binding domains and the functional 5' splice site. PTM29 lacks the 2"^ binding 
domain + 5' ss but still has the 3' binding domain 3' splice site, while PTM30 lacks the 

binding domain + 3' splice site but has the functional 5* splice site and 2*''* binding 
domain. 

To examine the double-/rara-splicing mediated restoration of p-gal 
10 function, 293T cells were either transfected with 2 ixg of target or PTM alone or 

co-transfected with 2 ixg of target + L5 /.ig of PTM using Lipofectamine Plus reagent. 
48 hrs. after transfection, total RNA was isolated and analyzed by RT-PCR using Kl-IF 
and Lac-6R primers. These primers amplify both cis- and /rara-spliced products in a \ 
single reaction which were identified based on the size. The c/5-spliced product is 295 bp 
15 in size while the /raw^-spliced product is 230 bp in size. To confirm that tranS'S^Xicing 
between DSPTM7 and DSCFT1.6 pre-mRNA is precise, RT-PCR amplified products 
were excised, re-amplified using K1-2F and Lac-6R primers and sequenced directly using 
K1-2F or Lac-6R primers. As shown in Figure 23 /ra«5-splicing occurred exactly at the 
predicted splice sites, confirming the precise internal exon substitution by two 
20 ^rara-splicing events. 

The repair of defective lacZ pre-mRNA by double /raw^-splicing events 
and subsequent production of full-length p-gal protein was investigated in co-transfection 
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assays. 293T cells were co-transfected with DSCFT1.6 target and DSPTM7 expression 
plasmids, as well as with DSCFTl .6 target or DSPTM7 alone as controls. Western blot 
analysis of total cell lysates using polyclonal anti-p-galactosidase antiserum specifically 
recognized a - 120 kDa protein only in cells co-transfected with DSCFTL6 target + 
5 DSPTM7 plasmids (Fig. 24, lanes 3 and 4) but not in cells transfected with either 

DSCFTl. 6 target (Lane 1) or DSPTM7 plasmid alone (Lane 2). Similarly, no full-length 
protein was detected in cells co-transfected with DSCFTl. 6 target + 3' splice mutant 
(Lane 5 and 6) or PTM29 or 30 in which either 3* /r^jrn^-splicing domain or 5* 
/raw^-splicing domains has been deleted (Lane 7). In addition, the 120 kDa protein band 

10 co-migrated with the full-length functional P-gal produced using lacZ-Tl plasmid 

(positive control, data not shown). These results not only confirmed the production of 
full-length protein by double-^a/75'-splicing between the target and P TM but also 
demonstrated that both the 3' splice site and 5' splice sites are essential for this process. ^ 
To determine whether the full-length protein produced by double-^aw.v- 

1 5 splicing between the target pre-mRNA and DSPTM7 RNA is functionally active, 293T 
cells were co-transfected with DSCFTl .6 targeted + one of the double splicing PTMs 5, 6 
or 7 expression plasmids, or transfected with DSCFTl .6 target or DSPTM7 alone. Total 
cell extracts were prepared and assayed for p-gal activity using ONPG assay (Invitrogen). 
P-gal activity in extracts prepared from cells transfected with either DSCFTL6 target or 

20 DSPTM7 alone was almost identical to the background levels detected in mock 

transfection (Fig. 25). In contrast, 293T cells co-transfected with DSCFTl. 6 target and 
DSPTM7 produced - 21 fold higher levels of p-gal activity over the background 
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(Fig. 25). These results confirmed the accurate double-Zraw^-spUcing between the target 
pre-mRNA and PTM RNA and production of the full-length functional protein. 

To confirm that restoration of p-gal activity by double-Zran^-splicing 
reaction is absolutely depended on the presence of both 3* and 5' splice sites of the PTM, 
5 we constructed several mutants: (a) DSPTM8, is identical to DSPTM7 except the 

functional 3* spice elements (branch point, polypyrimidine tract and the 3* acceptor AG 
dinucleotides) were deleted and substituted with random sequences (see Fig. 22 for 
details); (b) PTM29 lacks 5' splice site as well as the 5' binding domain but has the 
3* binding domain and 3' splice site, and (c) PTM30 lacks 3' binding domain and 3' splice 

10 site but has the 5' splice site and 5' binding domain, p-gal activity in extracts prepared 
from cells transfected with either DSCFT1.6 target or DSPTM7 alone was almost 
identical to the background levels detected in mock transfection (Fig. 26). Similarly, no 
significant increase in P-gal activity was detected in cells transfected with either 
DSPTM8 alone (3' splice site mutant) or co-transfection of DSCFT1.6 target + one of the 

15 above mutant PTMs. On the other hand, cells co-transfected with DSCFTL6 target and 
DSPTM7 with functional 3' and 5* splice sites produced substantial levels of P-gal 
activity over the background (Fig. 26). These results confirmed the requirement of both 
splice sites in the double-splicing PTM and also eliminated the possibility that restoration 
of p-gal activity was due to complementation between the truncated proteins (Fig. 26). 

20 Different concentrations of the target and PTM were co-transfected and 

analyzed for p-gal activity restoration. As expected, 293T cells co-transfected with 
DSCFT1.6 target + DSPTM7 showed substantial levels of p-gal activity (- 30 fold) over 
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the controls. Increasing the concentrations of the PTM by 2 and 3 fold did increase the 
level of p-gal activity, but not significantly (Fig. 27). These results further confirmed the 
double-/ra«5-splicing mediated restoration of p-gal enzyme function. 

The specificity of double-^rara-splicing reaction was examined by 
constructing a non-specific target (DSHCGTLl) which is similar to that of specific target 
(DSCFTl .6) but has pHCG intron 1 - pHCG exon 2 and PHCG intron 2 instead of CFTR 
mini-intron 9 - CFTR exon 10 (delta 508) and CFTR mini-intron 10 (Fig. 28). RT-PCR 
analysis of the total RNA isolated fi-om cells transfected with either DSHCGTLl (non- 
specific target) alone or in combination DSPTM7 (targeted to DSCFTl. 6 target) failed,to 
produce the expected 3 14 bp double-/raw5~spliced product. On the other hand, RT-PCR 
analysis of the total RNA prepared from cells co-transfected with specific target -i- PTM 
produced the expected 314 pb product. This was further confirmed by p-gal activity , 
assay of the total cellular extract. The level p~gal activity detected in cells transfected.^ 
with non-specific target alone or in combination with DSPTM7 targeted to DSCFTl .6 
target was almost identical to the background level. In contrast substantial levels of p-gal 
activity was detected in cells co-transfected with specific target (DSCFTl. 6) + DSPTM7 
(Fig. 27). These results confirmed that the double-Zrara-splicing is highly specific. 

The repair model in Fig. 30 shows a portion of a target CFTR pre-mRNA 
consisting of exons 1-9, mini-intron 9, exon 10 containing the delta 508 mutation, mini- 
intron 10 and exons 1 1-24 (Fig. 30). The PTM shown in the figure consists of exon 10 
coding sequences (containing codon 508) and two /ram-splicing domains each with its 
own splicing elements (acceptor and donor sites, branchpoint and pyrimidine tract) and a 
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binding domain complementary to intron 9 splice site, part of exon 10 (5* and 3' ends) and 
intron 10 5' splice site (Fig. 31 (DS-CFl)). Exon 10 of the PTM also has modified codon 
usage throughout to reduce antisense effects between exon 10 of the PTM and it's own 
binding domains and for PTMs that have binding domains which are complementary to 
5 exon sequences (Fig. 3 1). A double-rraw5'-splicing event between the PTM and target 
should produce a repaired full-length mRNA. 

Fig. 32 shows the sequence of a single PGR product showing target exon 9 
correctly spliced to PTM 20 exon 10 (with niodified codons) (upper panel), codon 508 in 
exon 10 of the PTM (middle panel) and PTM exon 10 correctly spliced to target exon J l. 
10 (lower panel). The sequence of a repaired target was generated by RT-PCR followed by 
PGR. 



10. EXAMPLE: TRANS^SFUCJNG REPAIR OF THE 

GYSTIC FIBROSIS GENE USING A PTM 

THAT GAN PERFORM 5* EXON REPLAGEMENT 

15 The key advantage of using 5' exon replacement for gene repair are 

(a) it permits replacement of the 5' portion of a gene 

(b) the construct requires less sequence and space than a full-length gene construct. 

(c) PTMs can be produced that lack a polyA signal which should prevent PTM 
translation, and (d) the 5' end can be modified to increase translation. 
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10.1 MATERIALS AND METHODS 
10.1.1 PLASMID CONSTRUCTION 
The CFTR coding sequences (exons I AO) for PTM30 were generated by 
PCR using a partial cDNA plasmid template (61 160; American Type Culture Collection, 
5 Manassas, VA). The /rara-splicing domain (TSD) [including the binding domain, spacer 
sequence, polypyrimidine tract (PPT), branchpoint (BP) and 3* splice site] was generated 
from a PCR product (using an existing plasmid template) and by armealing 
oligonucleotides. The different fragments (the TSD and coding sequences) were then 
cloned into pcDNA3.1(-) using appropriate restriction sites. Oligodeoxynucleotide 
10 primers were procured from Sigma Genosys (The Woodlands, TX). All PCR products 
were generated with either REDTaq (Sigma, St. Louis, MO), or cloned Pfu (Stratagene, 
La Jolla, CA) DNA Polymerase. PCR primers for amplification contained restriction 
sites for directed cloning. PCR products were digested with the appropriate restriction 
enzymes and cloned into the mammalian expression plasmid pc3.1DNA(-) (Invitrogen, 
15 Carlsbad, CA). 

10.2 CELL CULTURE AND TRANSFECTIONS 
Constructs were cotransfected in human embryonic kidney (HEK) 293T or 
293 cells (1.25 X 10^ cells per 60 nmi poly-d-lysine coated dish) using LipofectaminePlus 
(Life Technologies, Gaithersburg, MD) and the cells were harvested 48 h after the start of 

20 transfection. Total RNA was isolated as described in the manufacturers instructions 

c 

(Epicenter Technologies, Inc.). HEK 293T cells were grown in Dulbecco's Modified 

NY02:30I604.1 -80- 



1304B-A-A 072874.0134 



Eagle's Medium (Life Technologies) supplemented with 10% v/v fetal bovine serum 
(Hyclone, Inc., Logan, UT). All cells were kept in a humidified incubator at 37''C and 
5% CO2. 

10.1.3 REVERSE TRANSCRIPTION-POL YMERASE CHAIN 
5 REACTION (TR-PCR^ 

RT-PCR was performed using an EZ-RT-PCR kit (Perkin-Elmer, Foster 

CA). Each reaction contained 0.03 to 1.0 /xg of total RNA and 80 ng of a 5' and 3' 

specific primer in a 40 /ul reaction volume. RT-PCR products were electrophoresed on 

2% Seaken agarose gels. The PTM- and target-specific oligonucleotides used to generate 

1 0 trans-spliced products are 5'-CGCTGGAAAAACGAGCTTGTTG-3' (primer CF93) and 
5'-ACTCAGTGTGATTCCACCTTCTC-3* (primer CFl 1 1), respectively. The PTM- and 
target-specific oligonucleotides used to generate c/\y-spliced products were CFl and 
CF93. The sequence of oligonucleotide CFl is 
5'-GACCTCTGCAGACTTCACTTCTAATGArGATTATGG-3'. 

1 5 The repair model in Fig. 33 shows a portion of a target CFTR pre-mRNA 

consisting of exons 1-9, mini-intron 9, exon 10 containing the delta 508 mutation, mini- 
intron 10 and exons 1 1-24 (Fig. 33). The PTM shown in the figure consists of exon 1-10 
coding sequences (containing codon 508) and a /raw^-splicing domain v^th its own 
splicing elements (donor site, branchpoint and pyrimidine tract) and a binding domain. 

20 Several PTMs have been constructed with different binding domains. Three examples 
are shown in Figure 34. In Fig. 34A the binding domain is complementary to the splice 
site of intron 9 and part of exon 10 (3* end; CF-PTM 1 1). In Fig. 34B the PTM has an 
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extended binding domain which also covers the 5' end of exon 10 and the 3' splice site of 
intron 9 (CF-PTM 20). In the last example (Fig. 34C) the binding domain is the same as 
that shown in panel B except the binding domain extends the full-length of exon 10 (CF- 
PTM 30). In the latter case the PTM exon 10 has modified codon usage to reduce 
antisense effects with ifs own binding domain (Fig. 34). Further examples of binding 
domains are shown in Figure 35. 

Figure 36 shows the sequence of cis- and /raw^-spliced products. The top 
panel of Fig. 36A shows target exon 10 with it's three missing nucleotides (CTT), whilst 
the lower panel shows exon 10 and 11 of the target correctly spliced together. 
Figure 36B is a partial sequence of a single PGR product showing the modified codons in 
exon 10 of the PTM (upper panel), codon 508 in exon 10 of the PTM (middle panel), and 
PTM exon 10 correctly spliced to target exon 1 1 (lower panel), indicating that /raw- 
splicing is accurate. The sequence of the repaired target was generated by RT-PCR 
followed by PGR. 

11. EXAMPLE: PTMs WITH A LONG BINDING 
DOMAIN, WHIGH MAY BE DISGONTIN- 
UOUS, HAVE INCREASED ™A^5-SPLIGING 
EFFIGIENGY AND SPEGIFIGITY 

11.1. MATERIALS AND METHODS 

11.1.1. GELL CULTURE 

Human embryonic kidney cells (293 or 293T) were from the University of 

North Carolina tissue culture facility at Chapel Hill (Chapel Hill, NG). Cells were 



NY02:301604.1 



1304B-A-A 072874.0134 



maintained at 37*'C in a humidified incubator with 5% CO2 in Dulbecco's modified 
Eagle's medium (Life Technologies, Bethesda, MD) supplemented with 10% v/v fetal 
bovine serum (Hyclone, Logan, UT). Cells were passaged every 2-3 days using 0.5% 
trypsin and re-plated at the desired density- Stable cells, expressing an endogenous 
5 mutant /acZpre-mRNA (lacZCF9) were maintained in the presence of 0.5 mg/ml G418 
(Calbiochem, San Diego, CA). 

11.1.2. RECOMBINANT PLASMIDS 
Targets: pc3.11acZCF9, pc3.11acZCF9m, and pc3.11acZHCGlm. , 
pc3 . 1 lacZCF9 encodes for a normal lacZ pre-mRNA was constructed using lacZ coding % 

10 sequences nucleotides 1-1788 as 5' exon, CFTR mini-intron 9 followed by /acZ coding 

sequences nucleotides 1 789-3 1 74 as 3* exon. This is similar to pc3. 1 lacZ- r2 construct ^ 
but without stop codons in the lacZ 3' exon and has CFTR mini-intron 9 instead of 
pHCG6 intron 1 (Fig. 37A). CFTR mini-intron 9 was PCR amplified using plasmid T5 
as template and primers CFIN-9F (5'-CTAGGATCCCGTTCTTTTGTTCTTCACT 

1 5 ATTAA) and CFIN-9R (5^-CTAG GGTTACC GAAGTAAAACCATACTTATTAG. 
restriction sites underlined), digested with BamHl and BstE II and cloned in place of 
BHCG6 intron 1 of pc3.11acZ-T2 plasmid. pc3.11acZCF9m expresses a defective lacZ 
pre-mRNA and is identical to pc3.11acZCF9 but contains two in-frame non-sense codons 
in the 3* exon (Fig. 3 7 A). pc3.11acZHCGlm is a chimeric target, which includes the lacZ 

20 y exon followed by intron 1 and exon 2 of pHCG6. This is similar to pc3.11acZCF9m 
except that it contains exon 2 of PHCG6 in place of mutant lacZ 3* exon. pHCG6 exon 2 
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was PGR amplified using pHCG6 plasmid (accession # X00266) as template DNA and 
primers HCGEx-2F (S'-GCATGGTTACCCTGCAGGGGCTGCTGCTGTTGCTG) and 
HCGEX-2R (5'-CTGAAAGCllGTTAACCAGCTCACCATGGTGGGGCAG, 
restriction sites underlined) digested with BstE II and Hind III and cloned in place of the 
5 lacZ 3 ' exon of pc3 . 1 lacZCF9m. Plasmid pcDNA3 . 1 /HisB//acZ (Invitrogen, Carlsbad, 
CA) was used as DNA template to produce 5' and 3* lacZ exons. The lacZ 5' exon is 
1788 bp long, has an ATG initiation codon, lacZ 3' exon (without stop codons) is 1385 bp 
long and has a transcription termination signal at the erid of the 3' exon. CFTR 
mini-intron 9 and pHCG6 intron 1 are 548 bp and 352 bp hi size, respectively, and both 

10 have 5' and 3' splice signals. Exon 2 of pHCG6 is 162 bp long and has a transcription 
termination signal at the end of the exon. 

Pre-trans-splicing Molecules (PTMs): PTM-CF14 is an identical version 
of pcPTMl with minor modifications in the /r^n^*- splicing domain (Fig. 37B). 
PTM-CF14 is a linear version and contains a 23 bp antisense binding domain (BD) 

1 5 (5'-ACCCATCATTATTAGGTCATTAT) complementary to CFTR mini-intron 9, 1 8 bp 
spacer, a canonical branch point sequence (UACUAAC; BP) and an extended 
polypyrimidine tract (PPT) followed by normal lacZ y exon. PTM-CF22, PTM-CF24, 
PTM-CF26 and PTM-CF27 are identical to PTM-CF14 except they differ in length of the 
BD (Fig. 37B). sPTM-CF18 has a 32 bp BD, sPTM-CF22 and sPTM-CF24 contain the 

20 same BD as PTM-CF22 and PTM-CF24, respectively. In these PTMs, the binding 

domains were modified to create intra-molecular stem-loop structure ("safety") to mask 
the 3' splice-site of the PTM. Different binding domains were produced by PCR 
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amplification using specific primers (with unique Nhe I and Sac II sites) and a plasmid 
containing CFTR mini-intron 9 as template. PGR products were digested with Nhe I and 
Sac II and cloned into a PTM plasmid consisting of spacer sequences, 3* splice elements 
(BP, PPT and acceptor AG dinucleotide) followed by a normal lacZ 3* exon. 

11.1.3. TRANSFECTION OF PLASMID DNAs INTO 293T CELLS 
The day before transfection, 1x10^ 293T cells were plated on 60 mm 
plates coated with Poly-D-ly sine (Sigma, St. Louis, MO) to enhance the adherence of 
cells and grown for 24 hr at 37 °C. Cells were transfected with expression plasmids. using 
LipofectaminePlus reagent according to standard protocols (Life Technologies, Bethesda, 
MD). In a typical co-transfection, 2 //g of pc3.11acZCF9m target and 1 .5 ixg of PTM 
expression plasmids were transfected into cells and for controls (target and PTM alone 
transfections) total DNA concentration was normalized to 3.5 yug with pcDNA3.1 vector. 

Forty -eight hours after transfection the plates were rinsed with PBS, cells 
harvested and total RNA or DNA was isolated using MasterPure RNA/DNA purification 
kit (Epicenter Technologies, Madison, WI). Contaminating DNA in the RNA preparation 
was removed by treating with DNase I, while, contaminating RNA in the DNA 
preparation was removed by digesting with RNase A at 37 °C for 30-45 min. 
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11.1.4. REVERSE TRANSCRIPTION-POLYMERASE 
CHAIN REACTION (RT-PCR) 

RT-PCR was performed as suggested by manufacturer using an EZ rTth 

RNA PCR kit (Perkins-Elmer, Foster City, CA). A typical reaction (50 lA) contained 

5 25-500 ng of total RNA, 1 00 ng of 5' target specific primer (common to cis- and 

rrara-spliced products) (Lac-9F, 5'-GATCAAATCTGTCGATCCTTCC) and 100 ng of 

3' primer (Lac-3R, 5'-CTGATCCACCCAGTCCCATTA, target specific primer for 

m-splicing, and Lac-5R, 5'-GACTGATCCACCCAGTCCCAGA, PTM specific primer 

for rraw5'-splicing), IX reverse transcription buffer (100 mM Tris-HCl, pH 8.3, 900 mM 

10 KCL with 1 mM MnCy, 200 dNTPs and 10 units of rTth DNA polymerase. 

RT reactions were performed at 60°C for 45 min. followed by 30 sec pre-heating at 94°C 

and 25-35 cycles of PCR amplification at 94 °C for 18 sec, annealing and extension at 

60 °C for 1 min followed by a final extension at 70°C for 7 min. The reaction products 

were analyzed by agarose gel electrophoresis. 

15 11.1.5. PROTEIN PREPARATION AND p-GAL ASSAY 

Total cellular protein from cells transfected with expression plasmids was 
isolated by fi^eeze thaw method and assayed for P-galactosidase activity using a P-gal 
assay kit (Invitrogen, Carlsbad, CA). Protein concentration was measured by the 
dye-binding assay using Bio-Rad protein assay reagents (BIO-RAD, Hercules, CA). 
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11.1.6. WESTERN BLOT 
About 5-25 f^g of total protein was electrophoresed on a 7.5% SDS-PAGE 
gel and electroblotted onto PVDF-P membrane (Millipore). After blocking for 1 hr at 
room temperature (blocking buffer: 5% dry milk and 0.1% Tween-20 in IX PBS), the 
5 blot was incubated with a 1 :2500 dilution of polyclonal rabbit anti~p-galactosidase 
antibody for 1 hr at room temperature (Research Diagnostics Inc. NJ), washed 3x with 
blocking buffer and then incubated with a 1 :5000 diluted anti-rabbit HRP conjugated 
secondary antibody. After incubating at room temperature for 1 hr, it was washed 3x in 
blocking buffer and developed using ECLPlus Western blotting reagents (Amersham 
1 0 Pharmacia Biotech, Piscataway, NJ). 

11.1.7. INSITU^-GAL STAINING 
Cells were monitored for the expression of functional P-galactosidase 
using a p-gal staining kit (Invitrogen, Carlsbad, CA). The percentage of p-gal positive 
cells were determined by counting stained vs. imstained cells in 5-10 randomly selected 
15 fields. 

11.1.8. SELECTION OF NEOMYCIN RESISTANT CLONES 
EXPRESSING AN ENDOGENOUS DEFECTIVE lacZ 
PRE-mRNA TARGET 

On day 1,1x10^ 293 cells were plated on 60 mm plates and grown for 

20 24 hr at 37°C. On day 2, the cells were transfected with 2 fxg of pc3. llacZCF9m using 

LipofectaminePlus transfection reagent as described above. 48 hr post-transfection, cells 
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were split (1 :20 ratio) and grown in media containing 0.5 mg/ml G41 8. At the end of 
2 weeks, neomycin resistant colonies were selected, pooled, expanded and maintained 
constantly in the presence of G41 8. 

11.2. RESULTS 

5 A model system was developed that permits facile and versatile analysis of 

spliceosome mediated RNA trans-splicing in cells. The bacterial lacZ gene was split 
with a truncated intron 9 from the Cystic Fibrosis Transmembrane Conductance 
Regulator (CFTR) gene (Figure 37A). This split lacZ gene, when introduced into human 
293T cells, directed the synthesis of a lacZ pre-mRNA that could splice properly. The 

1 0 open reading frame of the lacZ gene was mutated by insertion of two in-frame nonsense 
codons near the 5' end of the second exon (Figure 37A). This lacZ gene is referred to as 
lacZCF9m. In 293T cells, lacZCF9m directs the synthesis of lacZCF9m pre-mRNA, . 
which encodes a truncated P-galactosidase (P-gal) protein that does not have enzymatic 
activity. Cells bearing the lacZCF9m gene are a model system for genetic disorders 

1 5 caused by loss of function mutations. 

Pre-/rart^-splicing molecules (PTMs) were designed to trans-splice with 
lacZCF9m pre-mRNA and repair the mutation caused by the two nonsense codons. 
PTMs were constructed with binding domains spanning 23, 91 and 153 nucleotides (nt), 
which we named PTM-CF14, PTM-CF22 and PTM-CF24 (Figure 37B). The PTM-CF24 

20 binding domain does not bind 153 contiguous nt in the targeted CFTR gene intron 9, but 
rather creates a loop of 47 nt in the target in between two regions of complementary of 27 
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and 126 nt (Figure 37B). These PTMs were predicted to repair the deficiency created by 
lacZCF9m (Figure 37C). 

Semi-quantitative RT-PCR analysis was used to tests the efficiency of 
^ra«5-splicing mediated by PTMs with long target binding domains. Repair of lacZCF9m 
5 transcripts by rraw^-splicing was tested in two different ways: co-transfection of PTM and 
target (lacZCF9m) plasmids or transfection of cells that had been modified to express the 
target as an endogenous pre-mRNA. Co-transfecting plasmids encoding PTMs with the 
lacZCF9m plasmid provided a facile method for screening the former for efficiency. 
PTM-CF22 and PTM-CF24 were approximately 3-fold and 10-fold more efficient,than 

10 PTM-CF14 in a semi-quantitative RT-PCR assay suggesting a significant improvement in 
mRNA repair (Figure 38). Sequencing of the RT-PCR products showed that trans- 
splicing was accurate, resulting in proper ligation of the exons from the target and the 
PTM. Moreover, mutation of key c/^-acting elements in the 3' splice site of the PTMs ^ 
resulted in an abrogation of /raw^-splicing. In these and all other assays described herein 

1 5 controls were carried out to rule out recombination at the DNA level. Thus, repair of the 
lacZCF9m transcripts was a result of targeted RNA /rara-splicing. 

Transfection of PTM-CF14, -CF22 or -CF24 into 293 cells bearing an 
endogenous lacZCF9m gene confirmed that the longer target binding domains provided 
the PTMs with higher efficiency (Figure 38B). It should be noted that similar levels of 

20 RT-PCR /ra«5-splicing specific product were obtained after 30 PCR cycles and 35 cycles 
for PTM-CF24 and PTM-CF14, respectively. The data therefore suggests that PTMs 
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with long binding domains repaired lacZCF9m transcripts at least an order of magnitude 
better than previously described PTMs. 

More than one in ten transcripts of lacZCF9m can be repaired by trans- 
splicing. Quantitative, real-time PGR w^as used to measure the fraction of lacZCF9m 
5 transcripts repaired by PTMs with long binding domains. The co-transfection assay 
described above was used in these experiments. PTM-CFM, which contains a binding 
domain of 23 nt, was shovm to repair between 1 .2 and 1 .6% of lacZCF9m RNAs in 293 T 
cells and 2.1% of lacZCF9m RNAs in the H1299 human lung cancer cells. PTM-CF24, 
which has a 153 not long binding domain, was significantly more efficient, correcting 

10 between 12.1 and 15.2% of lacZCF9m RNAs in 293T cells and 19.7% in H1299 cells. 
This in effect resulted in a measurable reduction in the levels of lacZCF9m mRNA. 
These data also confirmed the remarkable capability of this RT-PCR assay to distinguish 
between the products of c/^-splicing, the lacZCF9m and mRNA, and the products of 
^a«5-splicing, repaired lacZCF9m mRNA. This is the first true quantification of the 

15 efficacy of rra«^-splicing mediated mRNA repair at the RNA level. These data confirm 
the suggestions of the semi-quantitative RT-PCR analysis shovm above. Similar 
experiments were carried out using 293 cells that express an endogenous lacZCF9m pre- 
mRNA target. Consistent with the data shown above, PTM-CF24 was ten times more 
efficient than PTM-CF14, with the former correcting between 1.3 and 4.1% of 

20 endogenous lacZCF9m transcripts. These data confirmed that increasing the length of the 
PTMs provided a remarkable enhancement in trans-splicing efficiency. 
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Traw^-splicing mediated mRNA repair results in the synthesis of active 
p-galactosidase. At the cellular level, the ultimate criterion for the success of mRNA 
repair is the production of an active protein. Using a western assay it was determined that 
full-length p-gal was produced as a result of trans-splicing . Full-length p-gal was not 
5 observed following transfection of 293T cells with plasmids encoding lacZCF9m or 

PTM-CF24. Co-transfection of both plasmids, however, resulted in robust production of 
full-length p-gal protein, which was readily detectable using anti-p-gal antiserum 
(Figure 39). This result complements enzymatic activity data suggests that the latter was 
not due to a complementation by truncated p-gal proteins. The Western blot artalysis . 

10 revealed that full-length p-gal protein was made in 293T cells by /ran^-splicing and 

furthermore confirmed that the PTMs with long binding domains were efficiently spliced. 

Appropriate repair of p-gal mRNA and synthesis of full-length p-gal 
protein should lead to the production of active enzyme. Indeed, 293T cells co-transfected 
with lacZCF9m and PTM-CF24 were shown to have p-gal activity measured either in situ 

1 5 (Figure 40A) or in extracts (Figure 40B). This activity was shown to depend on the 

rran^-splicing between the target pre-mRNA and the PTM. The quantitative in solution 
assay further confirmed the data presented above: PTM-CF22 and PTM-CF24 were 2.9 
and 9.3 fold more efficient respectively than PTM-CF14. Most impressive, however, 
were results using 293 cells that harbor lacZCF9m as a stable endogenous gene. When 

20 these cells were transfected with PTM-CF14 the levels of p-gal activity obtained were 
barely above backgroxmd. Transfection with PTM-CF24, however, resulted in a 
considerable level of p-gal activity (Figure 40C). This was paralleled by the appearance 
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of full-length P-gal protein. These data demonstrate a sizeable increase in the efficiency 
of /raw^-splicing to repair a mutated pre-mRNA. In fact all prior reports of repair of 
endogenous RNA in mammalian cells by either group I ribozymes or /raw^-splicing have 
been only documented using RT-PCR, an indication of the low level of repair. 

PTMs with very long binding domains are highly specific. It was shown 
that a secondary structure within the binding domain could enhance specificity of PTMs 
in HeLa nuclear extracts. In order to ascertain the specificity of the /raw^-splicing 
reactions in vivo a second target gene was prepared, which could serve as reporter of non- 
specific reactions. This gene, which is referred to as lacZHCGlm, shares the first ^exon 
with lacZCF9m. The intron in lacZHCGlm is intron 1 of the p-subunit of the human 
chorionic gonadotropin gene 6 (phCG6) and the second exon is exon 2 of the same gene. 
lacZHCGlm drives the synthesis of a pre-mRNA that is spliced correctly to yield a 
chimeric mRNA that does not encode a full-length p-gal (see below). PTM-CF14, -CF22 
and -CF24 are not targeted to lacZHCGlm pre-mRNA since there is no complementarity 
between the binding domains in these PTMs and the target gene. Any /raw^-splicing 
between these PTMs and lacZHCGlm pre-mRNA is therefore non-specific (Figure 41A). 

293T cells were transfected with PTM-CF14, -CF22 or -CF24 and the 
level of non-specific /raw5'-spUcing was determined by RT-PCR and by in solution p-gal 
assays. Semi-quantitative RT-PCR suggested that PTM-CF24 was significantly less 
likely than PTM-CF14 to /raw^-splice with lacZHCGlm pre-mRNA. Measurement of p- 
gal activity confirmed this; cells co-transfected with lacZHCGlm and PTM-CF24 
produced 3.7 fold less p-gal than those co-transfected with lacZHCGlm and PTM-CF14 
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(Figure 41C). Based on these data it was estimated that PTM-CF24 is 50 times more 
likely to trans-splice to its target than to a non-specific target. A "safety" version of 
PTM-CF24, sPTM-CF24, did not confer further specificity (Figure 41C). Nonetheless, 
for PTMs with shorter binding domains a "safety" stem involving the binding domain 
5 was seen to improve specificity in vivo (Figure 41C). It was concluded from these data 
that the longer binding domains resulted in PTMs that were not only more efficient but 
also more specific. 

The observation that long binding domains increased the specificity of 
PTMs suggested that very long binding domains (>200 nt) could further enhance , 

10 discrimination. Plasmids encoding PTM-CF26 and -CF27, which have binding domains 
that span 200 nt and 41 1 nt respectively, were constructed and co-transfected with 
lacZHCGlm plasmid. Non-specific /raAj^-splicing of these two PTMs was barely 
detectable with RT-PCR (Figure 4 IB). As measured by the p-gal assay PTM-CF26 and - 
CF27 had minimal non-specific /r^fra-splicing activity (Figure 41C). In a specific trans- 

15 splicing reaction with lacZCF9m as measured by the solution p-gal assay PTM-CF26 was 
as acfive as PTM-CF14 (Figure 4 IB). It was estimated that PTM-CF26 is 80 times more 
likely to trans-splice to the specific target (lacZCF9m) than to a non-specific target 
(lacZHCGlm). Therefore, inclusion of very long binding domains confers to these PTMs 
very high specificity. 

20 The present invention is not to be limited in scope by the specific 

embodiments described herein. Indeed, various modifications of the invention in addition 
to those described herein vsdll become apparent to those skilled in the art from the 
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foregoing description and accompanying Figures. Such modifications are intended to fall 
within the scope of the appended claims. Various references are cited herein, the 
disclosure of which are incorporated by reference in their entireties. 
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